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Abstract 



The notion of communication complexity was introduced by Yao in his sem- 
inal paper [Yao79j . In |BFS86j . Babai Frankl and Simon developed a rich 
structure of communication complexity classes to understand the relation- 
ships between various models of communication complexity. This made it 
apparent that communication complexity was a self-contained mini-world 
within complexity theory. In this thesis, we study the place of regular lan- 
guages within this mini-world. In particular, we are interested in the non- 
deterministic communication complexity of regular languages. 

We show that a regular language has either 0(1) or f2(logn) non-determi- 
nistic complexity. We obtain several linear lower bound results which cover a 
wide range of regular languages having linear non-deterministic complexity. 
These lower bound results also imply a result in semigroup theory: we obtain 
sufficient conditions for not being in the positive variety Pol(Com). 

To obtain our results, we use algebraic techniques. In the study of regular 
languages, the algebraic point of view pioneered by Eilenberg ( [Eil74] ) has led 
to many interesting results. Viewing a semigroup as a computational device 
that recognizes languages has proven to be prolific from both semigroup 
theory and formal languages perspectives. In this thesis, we provide further 
instances of such mutualism. 
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Resume 



La notion de complexite de communication a d'abord ete introduite par Yao 
[Yao79] . Les travaux fondateurs de Babai et al. [BFS86] ont devoile une 
riche structures de classes de complexite de communication qui permettent 
de mieux comprendre la puissance de divers modles de complexite de commu- 
nication. Ces resultats ont fait de la complexite de communication une sorte 
de maquette petite chelle du monde de la complexit. Dans ce memoire, 
nous etudions la place des langages reguliers dans cette maquette. Plus 
precisement, nous chercherons determiner la complexite de communication 
non-deterministe de ces langages. 

Nous montrons qu'un langage regulier a une complexite de communi- 
cation so it O(logn), soit Q(\ogn). Nous etablissons de plus des bornes 
inferieures lineaires sur la complexite non-deterministe d'une vaste classe 
de langages. Celles-ci fournissent egalement des conditions sufnsantes pour 
qu'un langage donne n'appartienne pas la variete positive Pol(Com). 

Nos resultats se basent sur des techniques algebriques. Dans l'etude des 
langages reguliers, le point de vue algebrique, developpe initialement par 
Eilenberg |Eil74j s'est revele comme un outil central. En effet, on peut voir 
un semigroupe fini comme une machine capable de reconnaitre des langages 
et cette perspective a permis des avancees tant en theorie des semigroupes 
qu'en theorie des langages formels. Dans ce memoire, nous etablissons de 
nouveaux exemples de ce mutualisme. 
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Chapter 1 



Introduction 



1.1 Computational Complexity Theory 

The theory of computation is one of the fundamental branches of computer 
science that is concerned with the computability and complexity of prob- 
lems in different computational models. Computability theory focuses on 
the question of whether a problem can be solved in a certain computational 
model. On the other hand, complexity theory seeks to determine how much 
resource is sufficient and necessary for a computable problem to be solved in 
a computational model. 

Simply put, a computing device is a machine that performs calculations 
automatically: it can be as complicated as a personal computer and as simple 
as an automatic door. In theoretical computer science, a computational model 
is a pure mathematical definition which models a real-world computing de- 
vice. This abstraction is necessary in order to rigorously study computation, 
its power and limitations. The most studied computational model (which 
physically corresponds to the everyday computers we use) is the Turing Ma- 
chine. The most studied resources are time and space (memory) measured 
with respect to the input size. Based on these resources, different complexity 
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classes can be defined. For instance, P and NP are classes of problems that 
can be solved in polynomial (in the input size) time using a deterministic and 
a non-deterministic Turing Machine respectively. Whether these classes are 
equal or not is without a doubt one of the biggest open questions in computer 
science and mathematics. 

There are various interesting computational models including (but not 
limited to) Turing Machines, finite automata, context-free grammars, boolean 
circuits and branching programs. Their countless applications span com- 
puter science. For instance, when designing a new programming language 
one would find grammars useful. Finite automata and regular languages 
have applications in string searching and pattern matching. When trying to 
come up with an efficient algorithm, the theory of NP-completeness can be 
insightful. Many cryptographic protocols rely on theoretical principles. All 
these applications aside, the mathematical elegance and aesthetic inherent 
in theory of computation is enough to attract many minds around the world. 
And perhaps the main reason that computer science is called a "science" is 
because of the study of theoretical foundations of computer science. 

Despite the intense efforts of many researchers, our understanding and 
knowledge of computational complexity is quite limited. Similar to the P 
versus NP question, there are many other core questions (in different com- 
putational models) that beg to be answered. The focus of research in com- 
plexity theory is twofold. Given a certain problem, a computational model 
and a resource: 

• What is the maximum amount of resource we need to solve the problem 
in the computational model? 

• What is the minimum amount of resource we need to solve the problem 
in the computational model? 

The ultimate goal is to find matching upper and lower bounds. The first 



3 



CHAPTER 1. Introduction 



question can be answered by depicting a method^ of solving the problem and 
analyzing the amount of resource this method consumes. Almost always, 
the more challenging question is the second one. Proving results of the form 
"Problem p requires at least x resource." requires us to argue against all 
possible methods that solve the problem. In most computational models, 
this is intrinsically hard. Yet it should be also noted that complexity theory 
is a relatively new field and therefore can be considered as an amenable 
discipline of mathematics. 



1.2 Communication Complexity 

In this thesis, we will be studying a computational model which emulates 
distributed computing: communication protocols. In this model, there are 
usually two computers that are trying to collaboratively evaluate the value 
of a given function. The difficulty is that the input is distributed among the 
two computers in a predetermined adversarial way so that neither computer 
can evaluate the value of the function by itself. Therefore, in order to deter- 
mine the value of the function, these computers need to communicate over 
a network. The communication will be carried out according to a protocol 
that has been agreed upon beforehand. The resource we are interested in is 
the number of bits that is communicated i.e. we would like to determine the 
communication complexity of a given function. 

As an example, consider two files that reside in two computers. Suppose 
we wanted to know if these two files were copies of each other. How many 
bits would the computers need to communicate in order to conclude that the 
files are the same or not? What is the best protocol for the computers to 
accomplish this task? 

Note that the scenario here is quite different from information theory. In 

In the Turing Machine computational model, the method is called an algorithm. 
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information theory, the goal is to robustly transmit a predetermined mes- 
sage through a noisy channel and there is no function to be computed. In 
the communication complexity setting, the channel of communication is not 
noisy. What is sent through the channel is determined by the protocol and 
it usually changes according to the inputs of the computers and the commu- 
nication history. 

There are various models for communication complexity. The first de- 
fined was the 2-player deterministic model. Since then, non-deterministic, 
randomized, multi-party, distributional, simultaneous and many more mod- 
els have been defined and analyzed. 

Although the mathematical theory of communication complexity was first 
introduced in light of its applications to parallel computers ([Yao79]), it 
has been shown to have many more applications where the need for com- 
munication is not explicit. These applications include time/space lower 
bounds for VLSI chips ( |KN97j ). time/space tradeoffs for Turing Machines 
( [BNS92] ). data structures ( [KN97] ). boolean circuit lower bounds QGro92j, 
|HG91j . [Nis93] . [RM97j ). pseudorandomness ( |BNS92j ). separation of proof 
systems ( |BPS07j ) and lower bounds on the size of polytopes representing 
iVP-complete problems ( |Yan91j ). 

1.3 Algebraic Automata Theory 

One of the fundamental (and simplest) computational models is the finite 
automaton and it is usually the first model in theory of computation that 
computer science students are introduced to. The word "finite" refers to the 
memory of the machine and finite automata are models for computers with 
an extremely limited amount of memory (for example an automatic door). 
Even though it is a quite limited model, its well-known applications include 
text processing, compilers and hardware design. 
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In a nutshell, finite automata are abstract machines such that given a 
word over some alphabet as an input, it either accepts or rejects the word 
after processing each letter of the word sequentially. The set of all words that 
a finite automaton accepts is called the language corresponding to the finite 
automaton and we say that the language is recognized by this automaton. A 
language recognized by some finite automaton is called a regular language. 

Algebra has always been an important tool in the study of computational 
complexity. In the study of regular languages, semigroup theorjo has been 
the dominant tool. It should be mentioned that semigroups have shed new 
light not only on regular languages but on computational theory in general. 
On top of this, it is also true that computational theory has led to advances 
in the study of semigroup theory ( |TT04j ). 

The link between semigroups and regular languages has been established 
by viewing a semigroup as a computational machine that accepts/rejects 
words over some alphabet. In this context, it is not difficult to prove that 
the family of languages that finite semigroups recognize is exactly the regular 
languages. In fact, the connection between finite automata and semigroups 
is much more profound. There are several reasons why this point of view is 
beneficial. First of all, the semigroup approach to regular languages allows 
one to use tools from semigroup theory while investigating the properties of 
these languages. Eilenberg showed that there is a one to one correspondence 
between certain robust and natural classes of languages and semigroups. 
This has organized and heightened our understanding of regular languages. 
Furthermore, in certain computational models, the complexity of a regular 
language can be parametrized by the complexity of the corresponding semi- 
group and so this provides us alternate avenues to analyze the complexity of 
regular languages. Often the combinatorial descriptions of regular languages 
suffice to obtain upper bounds on their complexity. The algebraic point of 



2 A semigroup is a set equipped with a binary associative operation. 
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view proves to be useful when proving hardness results. Communication 
complexity is one of the computational models where this is the case. 

1.4 Outline 

In this thesis, we study the non-deterministic communication complexity of 
regular languages. The ultimate goal is to find functions /i(n), /2(n), /fc(n) 
such that each regular language has 0(/j(n)) non-deterministic communica- 
tion complexity for some % G {1, 2, k}. Furthermore, we would like a char- 
acterization of the languages with 0(/j(n)) complexity for alH e {1, 2, k}. 
In |TT03] . this goal was reached for the following communication mod- 
els: deterministic, simultaneous, probabilistic, simultaneous probabilistic and 
Modp-counting. Obtaining a similar result for the non-deterministic model 
requires a refinement of the techniques used in [TT03j. 

The study of the non-deterministic communication complexity of regular 
languages from an algebraic point of view is important for several reasons. 
We can summarize it by stating that it increases our understanding of regular 
languages and non-deterministic communication complexity. 

From regular languages perspective, our results yield sufficient algebraic 
conditions for not being in a certain class of languages. This is an interesting 
result within algebraic automata theory. Furthermore, given the fact that 
communication complexity has many ties with other computational models, 
understanding the communication complexity of regular languages helps us 
understand the power of regular languages in different computational models 
and where they stand within the complexity theory frame. 

From a communication complexity perspective, there are several inter- 
esting consequences. In |TT03j . it was shown that in the regular languages 
setting, O (log log n) probabilistic communication complexity coincides with 
O(logn) simultaneous communication complexity. Results about the non- 
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deterministic communication complexity leads to further such correspon- 
dences which allows us the compare different communication models within 
the regular languages framework. Even though regular languages are "sim- 
ple" with respect to Turing Machines for example, they provide a non-trivial 
case-study of non-deterministic communication complexity since there are 
both "hard" and "easy" regular languages with respect to this model. There- 
fore, a complete characterization of regular languages in this model is likely 
to force one to develop new lower bound techniques and study functions (for 
example promise functions) other than the commonplace ones which have 
been intensively studied. 

Through the notion of programs over monoids ( |Bar86j ). a connection be- 
tween algebraic automata theory and circuit complexity has been formed. For 
example algebraic characterizations of some of the most studied circuit classes 
AC , ACC° and NC 1 have been obtained ([BT87J). The connection between 
communication complexity and circuit complexity is well known. Currently, 
techniques from communication complexity provide one of the most powerful 
tools for proving circuit lower bounds ( [Cro92j . [HG9T] . |Nis93j . [RM97j ) . Al- 
gebraic characterization of regular languages with respect to communication 
complexity completes a full circle and further strengthens our understanding 
of the three fields. 




Communication Complexity 



Algebraic Automata Theory 




Circuits 
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The breakdown of the thesis is as follows. In Chapter 2, we give an in- 
troduction to communication complexity and present the fundamental tech- 
niques in this field. Chapter 3 is devoted to the basics of algebraic automata 
theory. The main purpose of these two chapters is to deliver the background 
material needed for Chapter 4. In Chapter 4, we present the results obtained 
about the non-deterministic communication complexity of regular languages. 
Finally we conclude in Chapter 5. 



Chapter 2 



Communication Complexity 



In this chapter, we present the notion of communication complexity as in- 
troduced by Yao in [Yao79] . We start in Section 2.1 with the deterministic 
model in which we look at the fundamental concepts. In Section 2.2, we move 
to the non-deterministic model which is the model of interest for this work. 
In Section 2.3, we briefly mention other popular communication models. In 
Section 2.4, we introduce the notion of a reduction which plays a key role 
in our arguments in Chapter 4. We also define communication complexity 
classes and see a beautiful analogy between these classes and Turing Machine 
classes. Finally we summarize this chapter in Section 2.5. 

We refer the reader to the much celebrated book by Kushilevitz and Nisan 
|KN97j for an in depth survey of the subject. One can also find and excellent 
introduction in the lecture notes by Ran Raz [Raz04j . We mostly use the 
notation used in |KN97j . 
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2.1 Deterministic Model 
2.1.1 Definition 

In the two-party communication complexity model, we have two players (usu- 
ally referred to as Alice and Bob) and a function / : X x Y — > Z. Alice is 
given x G X and Bob is given y £ Y. Both know the function / and their 
goal is to collaboratively compute f(x,y) i.e. they both want to know the 
value f(x,y). In order to do this they have to communicate (for most func- 
tions) since neither of them see the whole input. We are only interested in 
the number of bits that they need to communicate to compute f(x, y). Thus 
the complexity of their individual computations are irrelevant and we assume 
that both Alice and Bob have unlimited computational power. 

The communication of Alice and Bob is carried out according to a protocol 
V that both players have agreed upon beforehand. The protocol V specifies 
in each step the value of the next bit communicated as a function of the input 
of the player who sends it and the sequence of previously communicated bits 
by the two players. The protocol also determines who sends the next bit as 
a function of the bits communicated thus far. 

More formally, a protocol is a 5-tuple of functions (ca, cb, n, /a, /b) such 
that: 

• At each step of the communication, ca takes as input the communica- 
tion history thus far and the input for Alice and returns the bit that 
Alice will communicate (similarly for cb and Bob). 

• n takes as input the communication history thus far and decides whether 
the communication is over or not. If not, it decides who speaks next. 

• After the communication is over, Ja takes as input the communication 
history and the input for Alice and returns one bit (similarly for fs 
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and Bob). This bit is the output of the protocol and the values of /a 
and f B should be the same. 

Unless stated otherwise, the functions we consider in this chapter are of 
the form / : {0, l} n x {0, l} n — > {0, 1}. Since the output of the function is 
just one bit, we can assume that the last bit communicated is this value. 

Let V(x,y) be the output of the protocol V, i.e. the last bit commu- 
nicated. Then we say that V is a protocol for / if for all x,y e {0, 1}™, 
V(x,y) = f(x,y). The cost of V is defined as 

costCP) := max number of bits communicated for (x,y). 

(x,y)€XxY 

The deterministic communication complexity of a function /, denoted as 
D(f), is defined as 



An obvious upper bound for D(EQ) is n + 1 since one of the players, say 
Alice, can just send all her bits to Bob and Bob can simply compare the 
string he has with the string Alice has sent. If the strings are equal he can 
send 1 to Alice and otherwise he can send 0. In fact, this protocol gives an 
upper bound for any boolean function. Once one of the players knows all the 
input, s/he can compute the value of the function and send this value to the 
other player. The number of bits communicated is n + 1. 

Intuitively, one expects that D(EQ) — n + 1, i.e. n + 1 is also a lower 
bound for D(EQ). Although this intuition is correct, how can one rigorously 
prove this lower bound? 



D(f): 



V protocol for / 



min cost(V) 



Example 2.1. Define the EQUALITY function as 
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Figure 2.1: Example of an input matrix and a 0-monochromatic rectangle. 
2.1.2 Lower Bound Techniques 

As in any other computational model, proving tight lower bounds for the 
complexity of explicit functions in the communication model is usually a 
non-trivial task. Nevertheless, there are a number of effective techniques one 
can use to accomplish this. Now we explore three of these methods: the 
disjoint cover method, the rectangle size method and the fooling set method. 

For a function /, define the input matrix by A' xy = f(x, y) where the rows 
are indexed by x G X and the columns are indexed by y G Y. We say that 
R is a rectangle if R = S x T for some SCI and TCF. This is equivalent 
to saying that (xi,yi) G R and (x 2 ,y2) G R together imply (xi,y 2 ) G R. We 
say that R is monochromatic with respect to / if for some z G Z we have 
A f xy = z for all (x, y) G R (see Figure [2X2]) . 

Given /, let V be a protocol for /. For simplicity let us assume that the 
players send bits alternately. Also without loss of generality we can assume 
Alice (who has the input x) sends the first bit. Thus at step 1, the protocol 
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partitions X x Y = (X x Y) U (Xi x Y) such that 

Wx G X , Alice sends 0, 

Wx E Xi, Alice sends 1. 

At the second step it is Bob's turn to send a bit so the protocol partitions 
both X xY = (X x F 00 ) U (X Q x Y 01 ) and X l x F = (X x x F 10 ) U (X x x F n ). 
Here, if the first communicated bit was a 0, then 

\/y G loo, Bob sends 0, 

\/y G Yoi? Bob sends 1, 
and if the first communicated bit was a 1, then 

\/y G Y"io, Bob sends 0, 

\/y G Y"n, Bob sends 1. 

In general, if it is Alice's turn to speak and the bits communicated thus far 
are h,b 2 , ...b k , then Alice partitions xF bli ... i6fe into X 6l) .., )6fej0 xY bl ^ bk 

and Xb lv ..,b k ,i x Ybi,...,b k - A protocol partitioning tree nicely illustrates what 
happens (see Figure [2T2l . 

Observe that each node in the protocol partitioning tree is a rectangle 
and two nodes intersect if and only if one is the ancestor of the other. In 
particular, the leaves of the tree are disjoint rectangles. The same bits are 
communicated for all the inputs in a leaf so V(x, y) is the same for all these 
inputs, i.e. the leaves are monochromatic. The height of the tree is equal to 
cost(V). Thus we have proved the following lemma which is a key combina- 
torial property of a protocol. 

Lemma 2.1. A protocol V for f with cost(V) = c partitions the input matrix 
A' into at most 2 C monochromatic rectangles. 
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Figure 2.2: Protocol partitioning tree. 
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A monochromatic disjoint cover is a partition of a matrix into disjoint 
monochromatic rectangles. We denote by C D (f), the minimum number of 
rectangles in any monochromatic disjoint cover of A? . With this definition 
and the previous lemma at hand, we can present the first lower bound tech- 
nique. 

Corollary 2.2 (Disjoint Cover Method). For a function f we have D(f) > 
log 2 C D (f). 

With this tool it is now easy to show a linear lower bound for D(EQ). 
Observe that the input matrix for EQUALITY is a 2 n by 2 n identity ma- 
trix. No 1-monochromatic rectangle can contain more than one 1. Thus 
any monochromatic disjoint cover has 2 n 1-monochromatic rectangles and at 
least one O-monochromatic rectangle. So D(EQ) > |~log 2 (2 n + 1)] = n + 1 
as predicted. 

Although every protocol for a function induces a monochromatic disjoint 
cover of the input matrix, simple examples show that the converse is not 
true. So if some of the monochromatic disjoint covers do not correspond to 
any protocol, how good can the disjoint cover method be? The next theorem 
states that the gap is not very large. 

Theorem 2.3. For a function f, we have D(f) = 0(\og 2 2 C D (f)). 

Proof. For any function /, we present a protocol for it with complexity 
0(log2 C D (f)). The protocol consists of at most \og 2 C D (f) rounds and in 
each round at most \og 2 C D (f) + 0(1) bits are communicated. The basic 
idea is as follows: 

Alice and Bob agree upon an optimal disjoint monochromatic cover before- 
hand. They try to figure out whether (x, y) lies in a O-monochromatic rect- 
angle or a 1-monochromatic rectangle. The protocol proceeds in rounds. If 
f(x, y) = 1 then in each round they successfully eliminate at least half of the 
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O-monochromatic rectangles. At the end, all O-monochromatic rectangles are 
eliminated and they conclude f(x,y) = 1. If on the other hand f(x,y) = 0, 
then in one of the rounds they are not able to eliminate at least half of the 
O-monochromatic rectangles. At this point they conclude f(x,y) = 0. 
Before giving the details of a round, we make two crucial observations. The 
first observation implies the second one. The correctness of the protocol fol- 
lows from the second observation. 

Observation 1: Suppose Rq = So x To is a O-monochromatic rectangle and 
Ri — Si x T\ is a 1-monochromatic rectangle. Then either Rq and R\ are 
disjoint in rows or they are disjoint in columns, i.e. either So and Si are 
disjoint or T and T\ are disjoint. 

Observation 2: Let C be any collection of O-monochromatic rectangles and 
Ri any 1-monochromatic rectangle. Then either 

- Ri intersects with at most half of the rectangles in C in rows or 

- Ri intersects with at most half of the rectangles in C in columns. 

Otherwise there is at least one rectangle R in C such that R and i?i intersect 
both in rows and columns. This contradicts the first observation. 
Now we can describe how a round is carried out. Initially C contains all the 
O-monochromatic rectangles. 

A. If C = then Alice communicates to Bob that f(x, y) = 1 and the pro- 
tocol ends. Otherwise, Alice tries to find a 1-monochromatic rectangle 
R\ — Si x T\ such that x G Si and R\ intersects with at most half 
of the rectangles in C in rows. If such a rectangle exists, then Alice 
sends its name (log 2 C D (/) bits) to Bob and they both update C so 
it contains all the rectangles that intersect with R\ in rows (the other 
rectangles cannot contain (x,y)). At this point the round is over since 
they successfully eliminated at least half of the rectangles in C. If Alice 
is unable find such a rectangle then she communicates this to Bob. 
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B. At this point we know Alice could not find a 1-monochromatic rect- 
angle to end the round so Bob tries to end the round by finding a 
1-monochromatic rectangle Ri = Si x Ti such that y E T x and R\ 
intersects with at most half of the rectangles in C in columns. If he 
finds such a rectangle, he communicates its name to Alice and they 
both update C so it contains all the rectangles that intersect with Ri 
in columns. After this point the round is over. If he cannot find such a 
rectangle this means both Alice and Bob failed and therefore he com- 
municates to Alice that f(x, y) = because by the second observation, 
he knows that there is no 1-monochromatic rectangle containing (x, y) . 

□ 

In most cases it is hard to exactly determine C D (f). So the natural next 
step is to find lower bounds on C D (f) which in turn gives lower bounds on 
D(f). (This is actually what we did for the EQUALITY function.) 

An obvious way of bounding (from below) the number of monochromatic 
rectangles needed in a monochromatic disjoint cover is to bound (from above) 
the size of every monochromatic rectangle. In other words, if every monochro- 
matic rectangle in the input matrix has size less than or equal to s, then we 
need at least 2 2n /s monochromatic rectangles in a monochromatic disjoint 
cover of the matrix. Here 'size' refers to the number of pairs (x, y) in the 
rectangle and we can interpret this as a measure \i. The above actually 
generalizes to any kind of measure. 

Proposition 2.4 (Rectangle Size Method). Let fi be a measure defined on 
the space X x Y. If all monochromatic rectangles R (with respect to f) are 
such that n(R) < s, then D(f) > \og 2 (fi(X x Y)/s). 

In particular, if fi is a probability and every monochromatic rectangle R sat- 
isfies fi(R) < e, then D(f) > log 2 1/e. 
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Example 2.2. Let us see an application of the rectangle size method by 
proving a linear lower bound for the communication complexity of the DIS- 
JOINTNESS function. We define DISJOINTNESS as 

1 if x n y = 0, 
otherwise. 

where x and y are viewed as subsets of [n] (xi — 1 if x contains the element 
i £ [n]). We claim that any 1-monochromatic rectangle R = S x T has size 
at most 2 n . It is easy to show that the number of (x, y)'s such that xDy — 
is Y^=o (")2 n ~- ? = 3 n . Now if for all x and y that intersect we set /j,(x, y) — 
and for all x and y that are disjoint we set = 1 then x F) = 3 n 

and the above claim together with Proposition 12.41 imply D(DISJ) = Q(n). 
Proof of claim: Suppose \S\ = k and | U x( zsx\ = c. Then clearly k < 2 C . Also 
|T| < 2 n ~ c since every set in T must be disjoint from every set in S. Thus 
the size of the rectangle is |5| • |T| < k2 n ~ c < 2 c 2 n ~ c = 2 n . 

The last lower bound technique we look at in this section is the well- 
known fooling set technique. It is a direct consequence of the disjoint cover 
method and in fact it is a special case of Proposition 12.41 First we make the 
formal definition of a fooling set. 

Definition 2.5. A set F C X x Y is a fooling set for / if the following 
conditions are satisfied. 

1. For all (x, y) G F, f(x, y) = z for some z E Z. 

2. For all distinct (x^yt), (x 2 ,y 2 ) G F either f{x l ,y 2 ) ^ z or f{x 2 ,yi) ^ 
z. 

By the definition of a fooling set, no two elements in F can be in the 
same monochromatic rectangle. Therefore there must be at least \F\ many 
monochromatic rectangles in any monochromatic disjoint cover of the input 
matrix. So by Corollary 12.21 we get the following fact. 



DISJ(x,y) :-- 
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Lemma 2.6 (Fooling Set Method). If F is a fooling set for f then D(f) > 



To see that the fooling set method is indeed a special case of Proposition 
I2.4[ for a fooling set F, let y) = c > for every (x, y) G F and for every 
(x, y) ^ F set fi(x,y) = 0. Then any monochromatic rectangle R satisfies 
fi(R) < c and therefore 



where x and y are viewed as binary numbers. We can show that LT has 
linear deterministic communication complexity by the fooling set technique. 
Let F = {(x, x) : x G {0, l} n }. It is easy to see that F is a fooling set. 
Clearly \F\ = 2 n and this proves our claim. In fact F is also a fooling set for 
the EQUALITY function. 

From our discussion in this section, it is clear that we can exploit the nice 
combinatorial structure of protocols to prove tight lower bounds for explicit 
functions. In the next section, we see that most of the techniques seen in 
this section can be applied to the non-deterministic model as well. 



iog 2 m 



D(f) > logMX x Y)/c) = log 2 (c|F|/c) = log 2 \F 



Example 2.3. Define the LESS-THAN function as 




2.2 Non-Deterministic Model 



2.2.1 Definition 



The definition of the non-deterministic communication model is analogous to 
the non-deterministic model in the Turing Machine world. There are several 
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ways of defining non-determinism, all of which are equivalent. Here we will 
present the one that best suits our needs. 

Intuitively, non-determinism can be viewed as a certificate verification 
process^: A third player (referred to as God) gives a proof (bit string) that 
f(x, y) = z to both Alice and Bob. If indeed f(x, y) = z, then Alice and Bob 
must be able to convince themselves that this is the case by communicating 
with each other. If on the other hand f(x,y) ^ z, then the verification pro- 
cess should fail and Alice and Bob should be able to conclude that the proof 
was wrong. We consider the bits sent by God as a part of the communicated 
bits. 

More formally, in the non-deterministic setting, Alice and Bob communi- 
cate according to a non-deterministic protocol V z . This protocol differs from 
the deterministic one as follows. V z takes three inputs, x, y and s, where 
x and y are perceived as the inputs for Alice and Bob respectively, and s 
is some bit string which we think of as the "proof string". V z specifies in 
each step the value of the next bit communicated as a function of the input 
of the player who sends it, the sequence of previously communicated bits 
as well as s. It also determines who will send the next bit as a function of 
the communicated bits thus far. So it differs from a deterministic protocol 
because what a player sends also depends on the string s. We will denote 
the output of the protocol by V z (x, y, s). 

We say that V z is a non-deterministic protocol for / if for all (x, y) such 
that f(x, y) = z, there exists a string s such that V z (x, y, s) = z, and for all 
(x, y) such that f(x, y) ^ z we have V z (x, y,s) ^ z for any s. 

The the cost of V z is defined as 



1 Equivalently one can view it as a communication game in which the players are allowed 
to take non-deterministic steps. 



cost(V z ) : 



= max 



0,2/): 
f(x,y)=z 



min \s\ + no of communicated bits for (x,y, s). 

V z (x,y,s)=z 
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Figure 2.3: An example of a monochromatic cover of the 1-inputs. 
We define the non- deterministic communication complexity of / as 



The co-non-deterministic communication complexity of / is defined similarly 
and is denoted by N°(f). 

Example 2.4. Let us show N°(DISJ) = 0(log 2 n) by exhibiting a proof 
and a verification protocol. God can prove DISJ(x, y) = by telling Alice 
and Bob the index i in which x and y intersect. This proof is 0(log 2 n) bits 
long and Alice and Bob can convince themselves that the proof is correct by 
exchanging the bits Xj and y^ If DISJ(x,y) = 1, then given any index as a 
proof, Alice and Bob can detect that the proof is wrong. (Any other kind of 
proof is considered as a wrong proof.) 

Unlike the deterministic communication complexity we can get an exact 
characterization of non-deterministic communication complexity in terms of 
monochromatic rectangles. We denote by C z (f) the minimum number of 
z-monochromatic rectangles in any monochromatic cover of the z-inputs of 
/ (observe that here we dropped the word "disjoint" since we allow the 
rectangles to intersect). This quantity exactly determines N z (f). 



N\f) : 



mm 

V 1 non-deterministic protocol for / 



cost{V l ) 



Proposition 2.7. \og 2 C z (f) < N z (f) < \og 2 C z (f) + 2. 



Proof. 
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. teg 2 C*(f) < N*(f) 

We have seen in the deterministic case that a certain communication 
pattern corresponds to a certain monochromatic rectangle. This situa- 
tion is not much different in the non-deterministic model. In this case, 
what Alice and Bob send in each step also depends on the proof bits. So 
for every fixed proof string, there corresponds a protocol partitioning 
tree as in Figure 12.21 

Now observe that for this particular proof, every communication pat- 
tern that convinces Alice and Bob leads to a z-monochromatic rectan- 
gle. (Other communication patterns may not lead to a monochromatic 
rectangle since the proof we fixed may not be a proof for all (x, y) with 
f(x,y) = z.) So each convincing communication pattern (including 
the proof) corresponds to a ^-monochromatic rectangle. Since for ev- 
ery (x, y) such that f(x,y) = z there must be a proof that convinces 
Alice and Bob, all the convincing communication patterns together cor- 
respond to a covering of the ^-inputs. Here the rectangles are allowed 
to intersect since for some (x, y) with f(x, y) = z, there might be more 
than one proof that leads Alice and Bob to be convinced. There are at 
most 2 N *^> communication patterns and therefore C z (f) < 2 NZ ^>. 

• N z (f) <log 2 C*(/) + 2 

Fix any optimal monochromatic cover of the ^-inputs. If God sends 
Alice and Bob the name of a monochromatic rectangle R = S x T that 
(x, y) lies in, then Alice can check that x G S and if so, she can send 1 
to Bob. Bob can similarly check if y e T and send 1 to Alice if this is 
the case. 



□ 
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2.2.2 Power of Non-Determinism 

A natural question that arises in this context is: how much power does 
non-determinism give? Non-determinism in the finite automaton computa- 
tional model does not give extra power with respect to the class of languages 
recognized. In the Turing Machine model, it is not known whether non- 
determinism provides significantly more power. In the communication com- 
plexity model we can answer this question and prove that non-determinism 
is strictly more powerful. First we observe that the gap between determinism 
and non-determinism cannot be more than exponential. 

Proposition 2.8. For any z G {0, 1}, D(f) < C z (f) + 1. 

Proof. Alice and Bob agree on an optimal cover of the ^-inputs. Alice com- 
municates to Bob the ^-monochromatic rectangles that x lies in (this requires 
C z (f) bits of communication). Bob, with this information, can determine if 
there is a z-monochromatic rectangle that (x, y) lies in and send the answer 
to Alice. □ 

The above is actually tight. For example the EQUALITY function sat- 
isfies D(EQ) = n + 1 and N°(EQ) < log 2 n + 2 (similar protocol to the one 
in Example 12.41) . 

Can it be the case that both N°(f) and N l (f) are exponentially smaller 
than D(f)7 The answer to this question is given by the next theorem. 

Theorem 2.9. For every function f : X xY — >{0,1} ; 

D(f) = 0(N°(f)N\f)). 

The proof of this theorem is the same as the proof of Theorem 12.31 It 
was shown in |Fur87j that this bound is tight. 

Observe that there are two reasons why non-determinism is more powerful 
than determinism: 
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1. non-determinism is one sided in the sense that we only need to cover 
the z-inputs, 

2. the ^-monochromatic rectangles in the cover are allowed to overlap. 

From our discussion above it should be clear that the ultimate power comes 
from the first point. In the EQUALITY example we see that it is "easy" to 
cover the 0-inputs in the sense that we do not need exponentially many 0- 
monochromatic rectangles to cover the 0-inputs. The hardness lies in covering 
the 1-inputs. The exponential gap is a product of this fact. The power of a 
cover against a disjoint cover is only quadratic as implied by Theorem 12.91 

2.2.3 Lower Bound Techniques 

In the deterministic model, we saw the rectangle size method as a lower 
bound technique. It is clear that the same approach gives a lower bound 
for the non-deterministic communication complexity as well. If every z- 
monochromatic rectangle has size less than or equal to s and there are k 
^-inputs, then we need at least k/s many rectangles to cover these inputs. 
The non-deterministic version of Proposition 12.41 is as follows. 

Proposition 2.10. Let K C X x Y be the set of all z-inputs and let fx be a 

measure defined on the space K . If all z -monochromatic rectangles R satisfy 
H{R) < s, then N z {f) > log 2 (ji(K)/s). 

It can be shown that the rectangle size method in the non-deterministic 
case is almost tight. Suppose we choose the best possible measure fi (i.e. the 
one that gives the best bound) and the maximum size (with respect to //) of 
a z- mono chromatic rectangle is s. Then we have: 



Theorem 2.11 (see |KN97j ). N z (f) < log 2 (p(K) / s) + log 2 n + 0(1). 
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There are examples that show that we cannot do better than this. 

The fact that we can use the rectangle size method here implies that we 
can also use the fooling set method. However, as the next proposition shows, 
the quality of the fooling set method is questionable. 

Proposition 2.12 (see |KN97j ). Almost all functions f : {0, l} n x {0, l} n -> 

{0, 1} satisfy A^ 1 (/) = Q(n) but the size of their largest fooling set is 0(n). 

We finish off this section by looking at the non-deterministic communica- 
tion complexity of the PROMISE-DISJOINTNESS function. This function 
is defined the same way as the DISJOINTNESS function but the input space 
is different: it is the union of the following two sets A and B. 



In other words Alice and Bob are promised that if they get an input that 
intersects, then the size of the intersection is exactly 1. 

Now we show that the PROMISE-DISJOINTNESS (PDISJ) function 
has linear non-deterministic complexity. This fact is used in Chapter [4] to 
prove linear lower bounds for the complexity of certain regular languages. 
To show the linear lower bound, we use a result that implies a linear lower 
bound on the randomized communication complexity of the DISJOINTNESS 
function. Before we can state this result, we first need to define two measures 



A := {{x, y) e {0, 1}" X {0, l} n : x D y = 0}. 



B := {(x,y) e {0,l} n x {0,l} n : \xny\ = 1}. 



on {0, l} n x {0,1}". 
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Lemma 2.13 (see |Raz04j ). For any rectangle R = S x T , if 

V A {R) > 2~ n ' WQ 

then 

(for n large enough). 

In particular, if ^a(R) > 2~ n / 100 then R contains elements from both J. 
and B. Therefore to cover the inputs in A with 1-monochromatic rectangles^, 
we need exponentially many rectangles. This shows N 1 (PDISJ) = Q(n). 



2.3 Other Models 

In this section, we mention some of the most interesting and well-studied 
communication complexity models. 



Randomized Communication Complexity 

In the randomized setting, Alice and Bob both have access to random bit 
strings that are generated according to some probability distribution. These 
random strings are private to them and are independent. What Alice and 
Bob communicate depends on these random strings as well as their input 
and the previously communicated bits. We say that V is a protocol for / 
with e error if the following holds. 

V(x, y) e X x Y, Pt[P(x, y) = f(x, y)\ > 1 - e 

The cost of V is defined as the maximum number of bits communicated where 
the maximum is taken over all possible random strings and all inputs (x, y) . 

2 In this setting 1-monochromatic rectangles can contain any element from (X x Y)\B 
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The randomized communication complexity of / is 

R(f) '■= min cost(V). 

V protocol for / with error 1/3 

One can also define the one sided error randomized complexity. We say 
that V is a protocol for / with one sided e error if the following holds. 

V(x, y) E X x Y with f(x, y) = 0, Pr[V(x, y) = 0] = 1 and 

V(x, y) e X x Y with f(x, y) = 1, Pr[P(x, y) = 1] > 1 - e. 
Then the one sided randomized communication complexity of / is 

B}(f) := min cost(V). 

V protocol for / with one sided error 1/2 

There are also variations of the randomized model in which Alice and Bob 
have access to one public random string. (For a comparison see |New91j .) 

Distributional Communication Complexity 

In this setting, the definition of the cost of a protocol and the communica- 
tion complexity of a function are the same as the deterministic model. The 
difference is that we relax the condition 

W(x,y)eXxY, V{x,y) = f(x,y) 

to 

PTp[P{x,y) = f(x,y)] > 1-e 

for a given probability distribution fi on the input space XxY and a constant 
e. The distributional communication complexity of a function is denoted by 
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Multiparty Communication Complexity 

A natural way of generalizing the two player model to fc-players is as follows, 
fc-players try to compute a function / : X\ x X2 x ... x Xk — > Z where 
player i gets xi G X, and communication is established by broadcasting 
(every player receives the communicated bit). Observe that as the number 
of players increases, the power of the model decreases. 

Another way of generalizing the two party model to k players was pro- 
posed in |CFL83j . This model is referred to as "number on the forehead" 
model because here each player i sees every input but X{. This can be viewed 
as each player having their input on their forehead and not being able to see 
it. The power of this model increases as the number of players increases. In 
this setting, coming up with lower bounds is considerably harder. However, 
these lower bounds imply lower bounds in other computational models such 
as circuits and bounded-width branching programs. This is one of the rea- 
sons why this model has attracted more interest than the natural generaliza- 
tion mentioned previously. There are applications in time-space tradeoffs for 
Turing Machines QBNS92J), length- width tradeoffs for branching programs 
([BNS92]), circuit complexity QHC91| . [Cro92j . [Nii93] . |Cro98j ). proof com- 
plexity ([B PS07] ) and pseudorandom generators ( |BNS92j ). to cite only a 
few. 

2.4 Communication Complexity Classes 

It is possible to define complexity classes with respect to communication com- 
plexity once we settle what it means to be "easy" or "tractable" . Communica- 
tion complexity classes were introduced in [BFS86] in which "tractable" was 
defined to be polylog(n) complexity. That is, a function is tractable if its com- 
plexity is 0(log c n) for some constant c. From this foundation, one can build 
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communication complexity classes analogous to P, NP, coNP, BPP, RP and 
many more. For example P cc = {/ : D(f) = polylog(n)}. The correspon- 
dence between some of the complexity classes and the complexity measures 
can be summarized as follows. 



Complexity class 


pcc 


N pcc 


coNP cc 


B ppcc 


RP CC 


Complexity measure 


D 


N 1 


N° 


R 


R l 



The relationship between these classes are much better known than their 
Turing Machine counterparts since proving lower bounds for explicit func- 
tions is easier in the communication world. We have seen that the function 
NOT-EQUALITY satisfies D(NEQ) = n + 1 and N\NEQ) < log 2 n + 1. 
This proves P cc ^ NP CC . Since N°(NEQ) =0{n), we have coNP cc ^ NP CC . 

Theorem 12.91 shows that P cc = NP CC n coNP cc . It can also be shown that 
pcc ^ R pcc and N pcc £ B ppcc 

Remark. It is also possible to define analogs of the polynomial hierarchy. 

Reducibility and completeness are fundamental concepts in the Turing 
Machine computational model so it is natural to define the communication 
complexity analogs. 

The idea of reduction is as follows. Given two functions / and g, f reduces 
to g if Alice and Bob can privately convert their inputs x and y to x' and y' 
such that f(x, y) — 1 if and only if g(x', y') = 1. Suppose / reduces to g and 
that the inputs of length n are converted into inputs of length t(n). Then it 
is clear that if the communication complexity of g is 0(h(n)) then the com- 
munication complexity of / is 0(h(t(n))). If the communication complexity 
of / is Q(h(n)) then the communication complexity of g is f2(/i(t _1 (n))). 

Reductions of particular interest with respect to the communication com- 
plexity classes are those with t(n) = 2 log 2 n for some constant c. The formal 
definition as given in [BFS86] is as follows. 

Definition 2.14. Let t = 2 log 2 n for some constant c. A rectangular reduction 
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from a function / : {0, l} n x {0, l} n — > {0,1} to a function g : {0,1}* x 
{0, 1}' -»■ {0, 1} is a pair of functions a : {0, 1}™ -> {0, 1}' and 6 : {0, 1}" 
{0, 1}' such that /(a;, = 1 if and only if g(a(x), b(y)) = 1. 

From this definition it is clear that if there is a rectangular reduction from 
/ to g and g G P cc then / G P cc . The same is true for NP CC (and in fact for 
every level of the polynomial hierarchy). 

Example 2.5. For a fixed constant q > 1, define the INNER-PRODUCT 
function as follows. 



We exhibit a reduction from PDISJ to JP g such that an input of length n 
is converted into an input of length n + q. Since g is a constant, this proves 
that N l (IP q ) = tt(n). 

Given x and y, Alice and Bob each append q l's at the end of their inputs 
to obtain x' and y' . If PDISJ(x,y) = 1 then clearly IP q (x',y') = 1. If on 
the other hand PDISJ(x,y) = 0, then we know that x and y intersect only 
at one position and therefore x' and y' will intersect in q + 1 positions. This 
implies IP q (x',y') = 0. 

Having established the definition of a reduction, we can define the notion 
of completeness. For a class of functions C, we say / G C is complete in C if 
there is a rectangular reduction from every function in C to /. In [BFS86] , a 
complete function is found in every level of the polynomial hierarchy. 

2.5 Summary 

In this chapter we took a glimpse at the mini-world within complexity the- 
ory: communication complexity. The main focus in this area has been prov- 
ing tight lower bounds for specific functions. We showed three lower bound 




mod q, 
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techniques for the deterministic model. These were the disjoint cover method, 
rectangle size method and the fooling set method. We introduced the non- 
deterministic model and saw that the non-deterministic communication com- 
plexity of a function was essentially the number of z-monochromatic rectan- 
gles needed to cover the z-inputs. We saw that the rectangle size method, 
and therefore the fooling set method were also applicable in this setting. 
We looked at the power of non-determinism and observed that the possible 
exponential gap between the deterministic and the non-deterministic com- 
plexity arose from the fact that non-determinism was one sided. Later we 
touched on some other communication models: randomized complexity, dis- 
tributional complexity and multiparty complexity. Finally we defined some 
of the communication complexity classes, P cc , NP CC , coNP cc , BPP CC , RP CC , 
by considering poly log (n) complexity as tractable. Natural definitions of 
reducibility and completeness were also introduced. 

The deterministic and the non-deterministic communication complexities 
of the functions seen in this chapter are summarized with the following table. 





EQ 


NEQ 


LT 


DISJ 


PDISJ 


IP* 


D 


9(n) 


0(n) 


0(n) 


9(n) 


0(n) 


0(n) 


N 1 


9(n) 


0(log 2 n) 


9(n) 


0(n) 


0(n) 


0(n) 



Chapter 3 

Algebraic Automata Theory 



In this chapter, we introduce the reader to algebraic automata theory by 
presenting the fundamental concepts in this area. The heart of this theory is 
viewing a monoid as a language recognizer. Therefore we begin this chapter 
in Section 3.1 by explaining how a monoid can be viewed as a computational 
machine. Later we define the syntactic monoid of a language which is anal- 
ogous to the minimal automaton. Then we define varieties and state the 
variety theorem which establishes a one to one correspondence between va- 
rieties of finite monoids and varieties of regular languages. This conveys the 
intimate relationship between finite monoids and regular languages. In Sec- 
tion 3.2, we extend the theory to ordered monoids since (as we see in Chapter 
4) this provides the proper framework to analyze the non-deterministic com- 
munication complexity of regular languages. 

We assume that the reader has basic knowledge in automata theory. For 
more details on the subjects covered in this chapter, see [Pin86j and [Pin97j 
for the ordered case. 
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3.1 Monoids - Automata - Regular Languages 

3.1.1 Monoids: A Computational Model 

Before we can present how a monoid can be viewed as a computational 
machine, we first need to formally define a monoid and a morphism between 
monoids. A semigroup (S,-) is a set S together with an associative binary 
operation defined on this set. A monoid (M, •) is a semigroup that has an 
identity: 3 1m G M which satisfies 1m ■ m = m ■ 1m = Tn for any m G M. 
We denote a monoid by its underlying set and write m\m2 instead of m\ ■ irt2 
when there is no ambiguity about the operation. Observe that a group is 
just a monoid in which each element has an inverse. 

Given two monoids M and N, a function ip : M — > N is a morphism if 
<p(1m) — Ijv an d if f preserves the operation, i.e. if (mm') = ip(m)ip(m f ) for 
any m, mf G M. 

We assume that the monoids we are dealing with are finite, with the 
exception of the free monoid £* which consists of all words (including the 
empty word e) over the alphabet S, with the underlying operation being 
concatenation. Observe that any function tp : S — > M extends uniquely to a 
morphism $ : S* — > M. 

One branch of algebraic graph theory studies the connection between 
groups and corresponding Cayley graph representations of the groups. Sim- 
ilarly, monoids also have graph representations. Given a monoid M, we can 
construct a labeled multidigraph G = (V, A) as follows. Let V be the under- 
lying set of the monoid and let (mi, m?) G A with label m^ if m±m^ = m-i- 
See Figure 13.11 for an example. 

Now the correspondence between monoids and automata should be clear 
since we can easily view the graph of M as an automaton which recognizes a 
language over the alphabet M. All we need to do is declare the vertex 1 M as 
the initial state and agree upon a set of accepting vertices F C M. Observe 
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Figure 3.1: Graph of (Z 3 , +). 

that the graph of a monoid accepts a word m 1 m 2 ...m n iff mi -m 2 • ... -m n G F. 
In fact, once we fix a function <p : £ — > M, the graph of M recognizes a 
language over the alphabet S: replace each arc's label by its preimage under 
ip (now an arc can have more than one label). A word SiS2--.s n G X* is 
accepted iff <p(si) ■ (p(s 2 ) ■ ... • (f(s n ) G F. 

If we allow the set of accepting states to vary and the function if : E — > M 
to vary (for fixed S and M) then by viewing the monoid's graph as an 
automaton, we see that a single monoid can be used to recognize a family 
of languages over S. Each language in the family corresponds to a fixed set 
of accepting states and a fixed function tp. This leads to the more formal 
definition of recognition by a monoid. We say that a language L C S* is 
recognized by a finite monoid M if there exists a morphism $ : S* — > M 
and an accepting set F C M such that L = $ _1 (F). Similarly, we say that 
$ : S* — > M recognizes L if there exists F C M such that L = $ _1 (F). See 
Figure 13.21 for an alternative way of viewing M as a language recognizer. 

Given any monoid morphism : M —> N, the nuclear congruence with 
respect to \l/ is denoted by and is defined by m =^ m! if \l/(m) = \l/(m'). 
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Multiply 




In F? 


Yes 


in M 


— ► m ► 


► or 

No 



Figure 3.2: Another way of viewing a monoid as a machine. 

We say that a set of words is homogeneous with respect to L if either every 
word in the set is in L or none of the words is in L. Now observe that 
$ : S* — > M recognizes L if and only if the nuclear congruence classes of $ 
are homogeneous. This fact is used in the upcoming proofs. 

From the earlier discussion, we can conclude that if L is recognized by a 
finite monoid, then it is recognized by a finite automaton (the graph of the 
monoid) and therefore it is regular. In fact, the converse is also true. 

Theorem 3.1. L is regular if and only if a finite monoid recognizes L. 



Proof. If L is regular then it is recognized by a finite automaton. The def- 
inition of an automaton includes the transition function 5 : Q x E — > Q 
where Q is the set of states. This function can be naturally extended to 
5 : Q x X* — > Q. In other words, every word in S* defines a function from 
Q to Q. Let S w : Q — > Q, q i— > S(q,w), be the function corresponding to the 
word u>. Then it is easy to see that the set T := {5 W : w G E*} is a monoid 
with the operation being composition of functions. Furthermore T is finite 
since Q is finite. We call T the transformation monoid of the automaton. 
We claim that the transformation monoid T recognizes L. To see this let 
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$ : X* — > T be the canonical mapping: if i— ► 5^. $ is a morphism since 

= <U = 5 U o 5« = ®(u) o $(f ). 

If $(«) = <3>(i>) then m £ L iff t; e L so the nuclear congruence classes are 
homogeneous and thus $ recognizes L, which means T recognizes L. □ 

Theorem 13.11 constitutes the foundation of algebraic automata theory. It 
shows that finite monoids and finite automata have the same computational 
power with respect to the class of languages recognized. The proof reveals the 
strong link between monoids and automata. In fact, this link can be seen to 
be much stronger via the relation between the combinatorial propertied of L 
and the algebraic properties of a monoid recognizing L. With the purpose of 
exploring this relation, we define the syntactic monoid of a regular language. 



3.1.2 The Syntactic Monoid 

For every regular language there is a minimal automaton that recognizes it. 
Similarly, every regular language has a "minimal" monoid that recognizes it. 
We call this monoid the syntactic monoid and it is unique. 

The syntactic congruence associated with a language L C S* is denoted 
by =l and x =i y if for all u, v G S* we have uxv G L iff uyv G L. It is 
straightforward to check that this relation is indeed a congruence. The syn- 
tactic monoid of L is the quotient monoid E*/ =l and is denoted by M(L). 
Let [w] represent the congruence class of w with respect to the syntactic 
congruence. There is a well-defined operation: [w][u] = [wu], so the canon- 
ical surjective mapping $ : S* — > M(L), w i— > [w], is a morphism. Observe 
that any congruence class of =l is homogeneous (i.e. the nuclear congru- 
ence classes are homogeneous) so $ recognizes L. We call $ the syntactic 
morphism. 

1 Regular languages are definable by regular expressions which are combinatorial de- 
scriptions of the language. 
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It is quite easy to verify the following fact. 

Proposition 3.2. L is regular if and only if its syntactic monoid is finite. 

We say that a monoid A" divides a monoid M (denoted by A" -< M) if there 
exists a surjective morphism from a submonoidj of M onto N. Intuitively, 
this means that the multiplicative structure of A" is embedded in M. The 
syntactic monoid of L recognizes L and is the minimal monoid with this 
property with respect to division. 

Proposition 3.3. M(L) recognizes L and divides any other monoid that also 
recognizes L. 

Proof. We have already proved that M(L) recognizes L so we prove the 
second statement. 

Let M be any monoid that recognizes L. So there exists a morphism 
\& : S* — > M recognizing L. Let $ be the syntactic morphism. To show 
M(L) divides M, we find a surjective morphism T from a submonoid A" of 
M onto M(L). 

Before defining T we first prove the following claim: if ^f(a) = *&(b) 
then <3>(a) = $(6). Suppose not, so there exists a, b such that ^(a) = ^(b) 
but $(a) 7^ By the definition of $ this means that without loss of 

generality, there exists u, v such that uav G L but ubv L. We have 

\f/(uav) = ^(u)^(a)^(v) = W (u)W (b)^ (v) = ^(ubv) 

so ^> maps uav and ubv to the same element. Since nuclear congruence classes 
(with respect to ^) must be homogeneous and uav e L but ubv ^ L, we get 
a contradiction. 

Now let AT := *(£*). So N is a submonoid of M. Define T : N -> M(L), 
ty(w) h-> i.e. $ = Tof. 



2 A submonoid is a subset that contains the identity and is closed under the operation. 
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S* M(L) 

N 

By claim T is well-defined. Since $ is surjective, T is surjective. Fur- 
thermore, 

T(*(u)*(u)) = T(*(uu)) = $(uu) = $(u)$(u) = T(*(u))T(*(u)) 
and so T is a morphism. □ 

\i M ^ N and N -< M then M is isomorphic to iV. So as claimed before, 
for every regular language there is a unique (up to isomorphism) canonical 
monoid, the syntactic monoid, attached to it. An interesting property of 
M(L) is that it is isomorphic to the transformation monoid of the minimal 
automaton recognizing L. 

In the next subsection, we introduce the notion of language and monoid 
varieties. The combinatorial properties of a language are reflected on the 
algebraic properties of M(L) and varieties are the proper framework to for- 
malize this. 




3.1.3 Varieties 

We first give a brief overview of varieties and how monoid varieties and 
language varieties relate to each other. 

A variety of languages is a family of languages that satisfy certain condi- 
tions. Similarly a variety of monoids is a family of monoids satisfying certain 
conditions. The variety theorem states that there is a one to one correspon- 
dence between varieties of regular languages and varieties of finite monoids: 
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a variety of monoids V corresponds to the variety of regular languages V con- 
sisting of all the languages whose syntactic monoid is in V. Consequently, 
we are able to state results of the form: 

"A regular language belongs to the language variety V if and only if its 
syntactic monoid belongs to the monoid variety V." 

Many classes of languages that are defined combinatorially form language 
varieties and many classes of monoids that are defined algebraically form 
monoid varieties. So from above we can hope to reach results of the form: 

"A regular language has the combinatorial property P if and only if its 
syntactic monoid has the algebraic property Q." 

Schiitzenberger was the first to establish such a result: A regular lan- 
guage is star-freqfl if and only if its syntactic monoid is finite and aperiodic 
QSch65j). Several important classes of regular languages admit a similar al- 
gebraic characterization. This often yields decidability results which are not 
known to be obtainable by other means. For instance, by a result of Mc- 
Naughton and Papert ( |MP71j ). we know that regular languages definable 
by a first-order formula are exactly the star-free languages. This implies that 
we can decide if a regular language is first-order definable by checking if its 
syntactic monoid is aperiodic and this is the only known way of doing this. 
These types of algebraic characterizations of regular languages also provide 
one with powerful algebraic tools when analyzing and proving results about 
regular languages. 



Varieties of Finite Monoids 

A variety of finite monoids is a family of finite monoids V that satisfies the 
following two conditions: 

3 A language is star-free if it can be defined by a extended regular expression without 
the Kleene star operation. 

4 A monoid is aperiodic if no subset of it forms a non-trivial group. 
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(i) V is closed under division: if M G V and N -< M then JVG V, 

(ii) V is closed under direct product: if Mi , M 2 G V then M x x M 2 G V. 
Example 3.1. The following are some examples of varieties of monoids: 

• I is the trivial variety consisting of only the trivial monoid / = {1}. 

• M is the variety containing all finite monoids. 

• Com is the variety of all commutative monoids. 

• G is the variety of groups. 

• A is the variety of aperiodic monoids. 

• J is the variety of monoids M that satisfy Mm x M = Mm^M ==>- 
m i — 1^2- We call these monoids j7"-trivial. 

There is a convenient way of defining varieties of monoids through iden- 
tities. The notion of identities can be presented in two ways. One involves 
topological semigroups (see [Pin97] ) . which we wish to avoid. Therefore we 
use the presentation which we think is more intuitive. 

Let £ be a countable alphabet and u, v two words in S*. We say that a 
monoid M satisfies the identity u — v if for all morphisms <p : S* — > M we 
have tp(u) = tp(v). This means that if we replace the letters of u and v with 
arbitrary (but consistent) elements of M then we will arrive at an equality 
in M. For example a monoid is commutative if and only if it satisfies the 
identity ab = ba. 

It can be shown that the family of finite monoids consisting of the monoids 
that satisfy the identity u = v forms a variety. This variety is denoted by 
V(u,v). 
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Now let (u n ,v n ) n> Q be a sequence of pair of words in S*. Define 
W := lim V(u n ,v n ) =110 V(u n ,v n ). 

n— >oo 1 1 

m>0 n>m 

Observe that M G W if and only if there exists an no > such that for every 
n > no, M satisfies u n = v n . Here we say that W is ultimately defined by 
the sequence of identities [u n = v n ) n>0 . 

Theorem 3.4 (see |Pin86j). Every variety of monoids is ultimately defined 
by some sequence of equations. 

For example the variety of commutative monoids is ultimately defined by 
the constant sequence (ab = ba) n> Q. A less trivial example is the variety 
of aperiodic monoids. It can be shown that a finite monoid is aperiodic 
if and only if for each m G M there exists n > such that m n = m n+1 . 
Consequently, the variety of aperiodic monoids is ultimately defined by the 
sequence (a n = a n+1 ) n> o- The variety of commutative aperiodic monoids is 
ultimately defined by the sequence 

a 1 = a 2 , ab = ba, a 2 = a 3 , ab = ba, a 3 = a 4 , ab = ba, ... 

In such a case, for clarity, we say that the variety is ultimately defined by 
two sequences. 

An element m of a monoid is called idempotent if m ■ m = m. In finite 
monoids, idempotents play a key role. For instance, every non-empty monoid 
contains an idempotent. Indeed, if we take any element m of the monoid, 
then there exists a number n > such that m n is an idempotent (in fact 
this is the unique idempotent generated by m). This implies that for any 
finite monoid, there is a number k > such that for every element m in 
the monoid, we have that m k is an idempotent. We call k an exponent of 
M. Observe that if k is an exponent of M then for any n > k, n! is also an 
exponent of M. 
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We use n\ in many sequences of identities that ultimately define varieties 
of monoids. For example, the sequence (x nl yx n[ = x nl ) n>0 ultimately defines 
the variety of locally trivial monoids. From this, it should be clear that a 
monoid M is locally trivial if and only if for every idempotent e G M and 
every element m G M we have erne = e. As a convention, a sequence of 
equations involving n\ is written by replacing n\ with uo. So for example we 
use x u yx u = x w as an abbreviation for (x nl yx nl = x nl ) n> Q. It is easy to see 
that the variety of groups G is ultimately defined by x u = e. 

Given a sequence of identities E, we denote by [[E]} the variety that is 
ultimately defined by E. So for example we have G = [[x" = e]] and the 
variety of locally trivial monoids is [[af x! yx w = x u \\. 

Varieties of Regular Languages 

Before we can define a variety of regular languages we need some preliminary 
definitions. 

A class of regular languages is a function C that maps every alphabet £ 
to a set of regular languages in E*. 

A set of languages in E* that is closed under finite intersection, finite 
union and complementation is called a boolean algebra. 

Now a variety of regular languages is a class of regular languages V that 
satisfies the following conditions: 

(i) For any alphabet E, V(E) is a boolean algebra. 

(ii) V is closed under inverse morphisms: given any alphabets E and T, for 
any morphism $ : £* -> T*, if L G V(r) then G V(E). 

(iii) V is closed under left and right quotients: for L G V(E) and s G E, we 
have s^L := {w G L\sw G L} and Ls' 1 := {w G L\ws G L} are in 
V(S). 



43 



CHAPTER 3. Algebraic Automata Theory 



Example 3.2. The following are some examples of varieties of regular lan- 
guages: 

• The trivial variety: V(E) = {0,E*}. 

• The variety of all regular languages (each alphabet is mapped to all 
the regular languages over this alphabet). 

• The variety of star-free languages. 

• The variety of piecewise testable languages: A language is called piece- 
wise testable if there exists a k G N such that membership of any word 
in the language depends on the set of subwordjf] of length at most k 
occurring in that word. 

The Variety Theorem 

For a given finite monoid variety V, let V(E) be the set of languages in E* 
whose syntactic monoid belongs to V. Alternatively, we can define V as 
follows. 

Proposition 3.5. Let C(S) be the set of languages over E that is recognized 
by a monoid in V. Then V = C. 

Proof. V C C : If L G V then M(L) G V. M(L) recognizes L so L G C. 

C C V : If L G C then there exists M G V recognizing L. M(L) -< M and V 

is closed under division so M(L) G V. Therefore L G V. □ 

Now we can state the variety theorem due to Eilenberg ( [Eil74] ) . 

Theorem 3.6 (The Variety Theorem). V is a variety of languages and the 
mapping V t— > V is one to one. 

5 A word u — a\...a n is a subword of a word x if x = x§a\X\ai---a, n x n for some words 

X\ 5 •••) x n . 
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In light of this theorem, one can hope to explicitly make such correspon- 
dences. Two important correspondence results are the following. 

Theorem 3.7 ([Sch65j). The monoid variety A corresponds to the variety of 
star-free languages. Equivalently, a regular language is star-free if and only 
if its syntactic monoid is aperiodic. 

Theorem 3.8 ( [Sim75j ) . The monoid variety J corresponds to the variety 
of piecewise testable languages. Equivalently, a regular language is piecewise 
testable if and only if its syntactic monoid is J -trivial. 

Furthermore we can restate Theorem 13.11 as follows. 

Theorem 3.9. The monoid variety M corresponds to the variety of all reg- 
ular languages. Equivalently, L is regular if and only if its syntactic monoid 
is finite. 



3.2 Ordered Monoids 

In the previous section, we have seen that we can classify regular languages 
in terms of the monoids that recognize them. We were able to obtain alge- 
braic characterizations for certain classes of languages: varieties of languages. 
Many interesting combinatorially defined classes of languages form varieties. 
But there are other combinatorially defined classes of languages that do not 
form a variety. Of particular interest are families of languages that are not 
necessarily closed under complementation but satisfy the other properties of 
a variety. We call such families "positive varieties of languages" . Is it pos- 
sible to get a similar algebraic characterization for these languages as well? 
In particular, is there a result similar to Eilenberg's variety theorem that 
permits us to treat positive varieties? 

Fortunately the answers to the above questions are "yes". The idea is 
to attach an order on the monoids and adapt the definition of recognition 



45 



CHAPTER 3. Algebraic Automata Theory 



by monoids to ordered monoids. This point of view is a generalization of 
the unordered case and allows us to make a one to one correspondence be- 
tween varieties of ordered monoids and positive varieties of languages. This 
extension was introduced in [Pin95j . 

Intuitively speaking, the syntactic monoid has less information than the 
minimal automaton. One reason for this is that in the minimal automa- 
ton the accepting states are predetermined, but in the syntactic monoid the 
accepting set is not. As we see in the next subsection, the order on the 
monoid restricts the way we can choose the accepting set and consequently 
the ordered syntactic monoid recovers some of the missing information. This 
restriction lets us analyze classes of languages that are not closed under com- 
plementation. 

In this section, we go over the definitions and the results seen thus far, 
and present the analogous ordered counterparts. We start with the notion 
of recognition by ordered monoids. Then we define the syntactic ordered 
monoid. Later we look at varieties of ordered monoids, positive varieties of 
languages and the variety theorem that establishes a one to one correspon- 
dence between these ordered monoid varieties and positive language varieties. 

3.2.1 Recognition by Ordered Monoids 

An order relation on a set S is a relation that is reflexive, anti-symmetric and 
transitive and it is denoted by <. We say that < is a stable order relation 
on a monoid M if for all x,y,z G M, x < y implies zx < zy and xz < yz. 

An ordered monoid (M, <m) is a monoid M together with a stable order 
relation <m that is defined on M. A morphism of ordered monoids $ : 
(M, <m) (N, <n) is a morphism between M and N that also preserves 
the order relation, i.e. for all m,m' G M, m <m m' implies $(m) <n $(m'). 

The free monoid £* will always be equipped with the equality relation. 
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Observe that any morphism $ : S* — > M is also a morphism of ordered 
monoids $ : (£*, =) — > (M, < M ) for any stable order <m and vice versa. 

A subset / C M is called an order ideal if for any y <E I , x <m U implies 
x G /. Observe that every order ideal / in a finite monoid M has a generating 
set xi, Xfc such that I = (xi, Xk) '■— {y G M : 3xi with y <m X{}. 

Now the concept of recognizability is very similar to the unordered case. 
We say that a language L C S* is recognized by an ordered monoid (M, <m) 
if there exists a morphism of ordered monoids $ : (£*, =) — > (M, <m) and 
an order ideal /CM such that L = <£> _1 (J). Equivalently, L is recognized 
by (M, <m) if there exists a morphism $ : S* — > M and an order ideal 
I C. M such that L = Observe that this is a generalization of the 

unordered case in the sense that any monoid is an ordered monoid with the 
equality order (the trivial order) and any subset of the monoid is an order 
ideal with respect to equality. Also note that in the unordered case, if L is 
recognized by M, then so is the complement of L. In the ordered case, since 
we require the accepting set to be an order ideal, this statement is no longer 
true. This restriction on the accepting set allows the ordered monoid to keep 
more information about the automaton recognizing L. In this sense, one can 
think of the ordered case as a refinement of the unordered case. 

3.2.2 The Syntactic Ordered Monoid 

The definition of the syntactic congruence with respect to L is as exactly 
as before: x =l y if for all u, v G S* we have uxv G L iff uyv G L. Also 
the syntactic monoid is the quotient monoid M(L) = £*/ =£,. To be able 
to get a similar variety theorem for classes of languages not closed under 
complementation, we need to define a stable order on M(L) that allows us 
to obtain an ordered counterpart of the variety theorem. 

First, break up =l\ Let x y if for all u, v G £*, uyv G L uxv G L. 
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So x =l y if and only if x -<l y and y <l x. Now -<l induces a well-defined 
stable order <i on M(L) given by 

[%] <l [y] if an d on ly if x dtL y- 

It is straightforward to check that this is indeed a well-defined stable order. 
The ordered monoid (M(L), <l) is the syntactic ordered monoid of L. 

We say that an ordered monoid (N, <n) divides an ordered monoid 
(M, <m) if there exists a surjective morphism of ordered monoids from a 
submonoid^ of (M, <m) onto (N, <n)- 

Now we state and prove the analog of Proposition 13.31 We give the 
proof to demonstrate that slight modifications to the original proof suffices 
to obtain the proof for the ordered counterpart. 

Proposition 3.10. {M(L\<£) recognizes L and is the minimal ordered 
monoid with this property. 

Proof. Let $ : S* — > M(L) be the surjective canonical mapping: w i— > [w], 
i.e. $ is the syntactic morphism. We already know that the congruence 
classes are homogeneous so all we need to show is that the accepting set 
I is an order ideal, i.e. we need to show that if [y] 6 / and [x] <l [y], 
then [x] G I. Since [x] <l [y], we have x <l y and so for all u,v G E*, 
im/v G L =>■ wxf G L. In particular y <E L =^ x £ L. Since [y] G /, 
y E L and therefore x E L and so [sc] G / as required. 

Let (M, <m) be any monoid that recognizes L. So there exists a mor- 
phism \1> : S* — * M and an order ideal / CM such that L = Let 
$ be defined as above. To show (M(L), <l) divides (M, <m) w e find a sur- 
jective morphism of ordered monoids T from a submonoid of (M, <m) onto 
(M(L),< L ). 



6 A submonoid of (M, <jw) is a submonoid of M with the order being the restriction of 
<m to the submonoid. 
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We let N := *(M) so (N, < M ) is a submonoid of (M, < M ). Define T 
to be the same function as the one we defined in the proof of Proposition 
E02 so T is such that T : (N,< M ) -> (M(L),< L ), #0) h+ i.e. 
$ = T o As shown before, T is a well-defined surjective morphism. 
What remains to be shown is that T is a morphism of ordered monoids. 
For this, we need to show *(a) < M #(&) T(*(o)) <l T(*(6)), i.e. 

*(a) <m *(&) =}► $(a) < L $(6). 

Suppose the above is not true. So there exists a and 6 such that ^(a) <m 
ty(b) but $(a) This means that a b and therefore there exists 

ii,d6 S* such that ufef G L but -uat> ^ L. On the other hand, since <m is a 
stable order we have 

ubv G L implies that ^(ubv) G / and by above and the fact that / is an 
order ideal we must have that ^f(uav) G /. This is a contradiction since 
uav $l L. □ 

3.2.3 Varieties 

The definition of an ordered monoid variety is identical to the unordered case. 
We say that a family of ordered monoids V is a variety of ordered monoids 
if it is closed under division of ordered monoids and finite direct product^. 

Similar to unordered monoid varieties, varieties of ordered monoids can 
be defined using identities. We say that (M, <m) satisfies the identity u < v 
if and only if for every morphism (f : S* — > M we have (f(u) <m V 9 ( t ')- Let 
V(w,t>) be the variety of ordered monoids that satisfy the identity u < v. 
Then given a sequence of pair of words (w„, f„)n>o> W := lim V(-u„, t> n ) is 
said to be ultimately defined by this sequence. 

7 The order in a finite direct product M\ x ... xM n is given by (toi, m n ) < (m'i, m' n ) 
iff mi < m! i Vi G [n]. 
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Theorem 3.11 ( [PW96] ). Every variety of ordered monoids is ultimately 
defined by some sequence of identities. 

Now we define positive variety of languages. A set of languages in E* that 
is closed under finite intersection and finite union is called a positive boolean 
algebra. So it differs from a boolean algebra because we do not require the 
set to be closed under complementation. A class of languages V is called a 
positive variety of languages if it is a positive boolean algebra, is closed under 
inverse morphisms and is closed under left and right quotients. 

For a given variety of finite ordered monoids V, let V(S) be the set of 
languages over £ whose syntactic ordered monoid belongs to V. As before, 
this is equivalent to saying that V(E) is the set of languages over £ that are 
recognized by an ordered monoid in V. 

Theorem 3.12 (The Variety Theorem [Pin95j ). V is a positive variety of 
languages and the mapping V i— > V is one to one. 

Now we give two explicit correspondences. The interested reader can find 
the proofs in |Pin95] . 

Let T be a subset of the alphabet S. Define L(T) as 



aer 

This is equivalent to saying that L(T) is the set of words that contain at least 
one occurrence of each letter in T. 

A monoid is idempotent if every element in the monoid is idempotent. 

Theorem 3.13. A language in S* is a finite union of languages of the form 
L(r) for r C S if and only if it is recognized by a finite commutative idem- 
potent ordered monoid (M, <m) i n which the identity is the greatest element 
with respect to the order. 
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A language L is a shuffle ideal if it satisfies the following property: if a 
word w has a subword in L, then w is in L. 

Theorem 3.14. A language is a shuffle ideal if and only if it is recognized 
by a finite ordered monoid in which the identity is the greatest element. 

We conclude this chapter by pointing out that our main interest is in pos- 
itive varieties of languages (and consequently in ordered monoids) because 
regular languages having 0(f) non-deterministic communication complexity 
form a positive variety of languages (see next chapter). For the communica- 
tion models studied in [TT03] . regular languages having 0(f) communication 
complexity form a variety of languages and so the theory of ordered monoids 
is not necessary. From now on, we abandon unordered monoids and work 
with the more general theory of ordered monoids. 



Chapter 4 

Communication Complexity of 
Regular Languages 



The main goal of this chapter is to prove upper and lower bounds on the non- 
deterministic communication complexity of regular languages. In Section 4.1, 
we formally define the communication complexity of finite ordered monoids 
and regular languages. We prove two theorems that establish the soundness 
of an algebraic approach to the communication complexity of regular lan- 
guages. In Section 4.2, we present a form of the definition of rectangular 
reductions and introduce local rectangular reductions. Then we present up- 
per and lower bound results for regular languages in which the lower bounds 
are established using rectangular reductions from three functions we have 
seen in Chapter 2. We also state an intriguing conjecture that gives an ex- 
act characterization of the non-deterministic communication complexity of 
regular languages. 
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4.1 Algebraic Approach to Communication 
Complexity 

In Chapter 2, we studied the communication complexity of functions that 
have 2 explicit inputs, each being an n-bit string. In order to define the 
communication complexity of a monoid and a regular language, we need to 
generalize the definition of communication complexity to include functions 
that have a single input string. Suppose a function / has one n-bit string 
input and let A U B be a partition of [n]. Then the communication 
complexity of / with respect to this partition is the communication complex- 
ity of / when Alice receives the bits x« for all i G A and Bob receives the bits 
Xj for all j G B. For instance, in the non-deterministic model we denote this 
by N\ B (f). In this case, the non-deterministic communication complexity of 
/ is defined as 

N\f) :=maxiVA B (/) 

where the maximum is taken over all possible partitions of [n]. The partition 
that achieves this maximum is called a worst case partition. 

Note that the communication complexity definitions and results seen thus 
far apply to functions that have inputs that are strings of length n over any 
fixed alphabet. That is, the requirement of bit strings as inputs can be 
relaxed. 

We define the communication complexity of a finite ordered monoid using 
the worst-case partitioning notion. The communication complexity of a pair 
(M, I) where M is a finite ordered monoid and I is an order ideal in M is the 
communication complexity of the monoid evaluation problem corresponding 
to M and I: Alice is given mi, m 3 , m 2 „-i and Bob is given m.2, m.4, m 2ri 
such that each m, G M. They want to decide if the product mim 2 ...m 2n is 
in I. The communication complexity of M is the maximum communication 
complexity of (M, I) where I ranges over all order ideals in M. Observe that 
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if for example Alice were to receive m 8 and m i+ i, then she could multiply 
these monoid elements and treat them as one monoid element. This is why 
for a worst-case partition, Alice and Bob should not get consecutive monoid 
elements. 

Similarly, we define the communication complexity of a regular language 
L C S* as the communication complexity of the language problem corre- 
sponding to L: Alice is given a±, 0,3, a2 ra _i and Bob is given a2, 04, a2 n 
such that each a, G SU {e} where e represents the empty word in S* (also 
referred to as the empty letter). They want to determine if a\a>2---(i2n G L. 
The way the input is distributed corresponds to the worst-case partition since 
we allow di to be empty letters. 

As mentioned in Chapter 1, our aim is to find functions fi(n), ...,/& (n) 
such that each regular language has Q(fi(n)) non-deterministic communi- 
cation complexity for some i G {1,2,..., fc}. We would also like a charac- 
terization of the languages with 0(/j(n)) complexity for all % G {1, 2, k}. 
The next two results show that such a characterization can be obtained by 
looking at the algebraic properties of regular languages. 

Theorem 4.1. Let L C S* be a regular language with M(L) = M. We have 
ATi(M) = e(A^(L)). 

Theorem 4.2. For any increasing function f : N — > N, the class of ordered 
monoids V such that each monoid Me V satisfies A^M) = 0(f) forms a 
variety of ordered monoids. 

These two theorems together with the variety theorem imply that the class 
of languages V such that for any L G V we have A X (L) = 0(f) forms a pos- 
itive variety of languages. So a characterization in terms of positive varieties 
is possible. Furthermore, observe that communication complexity of monoids 
parametrize the communication complexity of regular languages. Bounds on 
monoids yield bounds on regular languages and vice versa. When proving 
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such bounds, carefully choosing between the two directions can considerably 
simplify the analysis. Usually we find that upper bound arguments are easier 
to establish with the combinatorial descriptions of a language whereas lower 
bound arguments are easier to establish with the algebraic descriptions of 
the corresponding syntactic monoid. 

Proof of Theorem J^.l, First we show that N X {V) = 0(N 1 (M)). For this, we 
present a non-deterministic protocol for L. Suppose Alice is given aia 2 -..a n 
and Bob is given b^b<i ...b n . Let $ be the syntactic morphism and let / 
be the accepting order ideal. The protocol is as follows: Alice computes 
$(ai), $(a n ) and Bob computes $(&i), $(&„). Using the protocol for 
the monoid evaluation problem of (M, /), they can decide at 0(N 1 (M)) cost 
if 

$(ai)$(&i)...$(a n )$(6 n ) = $(a 1 b 1 ...a n b n ) G /. 

This determines if aibi...a n b n is in L or not. 

Now we show that N l (M) = OiN 1 ^)). We present a protocol for (M, /) 
where I = (ii, ik) is some order ideal in M. Before presenting the protocol, 
we first fix some notation and definitions. Again let $ be the syntactic 
morphism. For each monoid element m, fix a word that is in the preimage of 
m under and denote it by w m . Let Y a := {(u,v) : uav G L}. Recall that 
a b if for all u, v G £*, ubv G L =^ uav G L. So 

$(a) < L $(6) iffo±6iffncy„. 

For each Y a and Y b with Y b <^ Y a , pick (u,v) such that (u,v) G Y b but 
(u, v) ^ Y a . Let K be the set of all these (u, v). One can think of K as 
containing a witness for Y b ^ Y a for each such pair. Note that K is finite. 
Now pad each w m and each word appearing in a pair in K with the empty 
letter e so that each of these words have the same length. Observe that this 
length is a constant that does not depend on the length of the input that 
Alice and Bob will receive. 
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Now assuming that Alice and Bob have agreed upon the definitions made 
thus far, the protocol is as follows. Suppose Alice is given mf, m^, 
and Bob is given m^, m|, n$ n . For each ij they want to determine if 
m\rn\...m a n rr^ n <l ij- This is equivalent to determining if w m a m 6... m a m b 
Wy, and this is equivalent to w m aw m b...w m aw m b <l Wi r If this is not the 
case, then Y w . <£Y WaW , w aW . and so there will be a witness of this in 
K, i.e. there exists (u,v) such that uw i;j v G L but uw m aw m b ...w m °w m bV L. 
If indeed w m aw m b...w m aw m b then for each (u,v) G K with uw^v G L, 

we will have uw m aW m b...w m °w m bV G L. Using the protocol for L, Alice and 
Bob can check which of the two cases is true. The following shows how Alice 
and Bob's inputs look like before running the protocol for L. Note that each 
block has the same constant length. 



u 




ee...e 




W m a 


ee...e 


V 




6S...6 


€€...€ 






€€...€ 




€€...€ 



□ 

The proof of Theorem 14.21 follows from the following two lemmas. The 
first lemma shows that V is closed under finite direct product. The second 
lemma shows that V is closed under division of monoids. 

Lemma 4.3. Let (M, <m) and (N, <n) be ordered monoids. Then A" 1 (M x 
N) < N 1 (M)+N 1 (N). 

Proof. Any order ideal in M x N will be of the form I x J where I is an 
order ideal in M and J is an order ideal in N. Therefore testing whether a 
product of elements in M x iV is in an order ideal I x J or not can be done 
by testing if the product of the first coordinate elements is in I and testing 
if the product of the second coordinate elements is in J. □ 
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Lemma 4.4. Let (M, <m) and (N, <n) be ordered monoids such that N -< 
M. Then N X {N) < N^M). 

Proof. Since N -< M, there is a surjective morphism from a submonoid M' 
of M onto N. Denote by a fixed element from the preimage of n. 

Let I be an order ideal in N. A protocol for (N, I) is as follows. Alice 
is given n", n!^, and Bob is given n\, n\, n\. They want to decide if 
n\n\...n1n b t e /. This is equivalent to deciding if 

r\n\)r\n\)..4- 1 {n a t )<t>-\n b t ) e ^(Z). 

It can be easily seen that 4>~ 1 {I) is an order ideal in M' so Alice and Bob 
can use the protocol for M' to decide if the above is true. Therefore we have 
N l (N) < N l (M'). It is straightforward to check that N l (M') < N l (M) and 
so TV 1 (TV) < N\M) as required. □ 

4.2 Complexity Bounds for Regular Languages 
and Monoids 

In this section, we present upper and lower bounds for the non-deterministic 
communication complexity of certain classes of languages. Upper bounds 
are established by presenting an appropriate protocol whereas lower bound 
arguments are based on rectangular reductions from the following functions: 
LESS-THAN, PROMISE-DISJOINTNESS, INNER-PRODUCT. In Chapter 
2, we have seen that each of these functions require linear communication in 
the non-deterministic model. We have also seen the definition of a rectangular 
reduction. We give here a form of this definition which specifically suits our 
needs in this section. 

Definition 4.5. Let / : {0,l} n x {0, l} n -> {0,1}, M a finite ordered 
monoid and I an order ideal in M. A rectangular reduction of length t 
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from / to (M,I) is a sequence of 2t functions a±, &2, 03, 02t-i, with 
at : {0, l} n — > M and &j : {0, l} n — > M and such that for every x,y G {0, l} n 
we have f(x,y) = 1 if and only if the product ai(x)&2(y)-"&2t(y) is i n ^- 

Such a reduction transforms an input (x, y) of the function / into a se- 
quence of 2t monoid elements mi,m,2, ...,m,2t where the odd-indexed are 
obtained as a function of x only and the even-indexed are a function of 

We write / <* (M, 7) to indicate that / has a rectangular reduction of 
length t to (M, 7). When t = O(n) we omit the superscript t. It should be 
clear that if / <* (M, 7) and / has communication complexity Q(g(n)), then 
(M, 7) has communication complexity f2(g(t _1 (n))). 

Most of the reductions we use here are special kinds of rectangular reduc- 
tions. We call these reductions local rectangular reductions. In a local rectan- 
gular reduction, Alice converts each bit Xi to a sequence of s monoid elements 
m° l5 ml 21 ■■■, m is by applying a fixed function a : {0, 1} — > M s . Similarly Bob 
converts each bit yi to a sequence of s monoid elements m\ Xl m\ 2l ...,m\ s by 
applying a fixed function b : {0, 1} — > A7 S . f(x,y) = 1 if and only if 

We often view the above product as a word over M. The reduction transforms 
an input (x, y) into a sequence of 2sn monoid elements. Let a(z)k denote the 
k th coordinate of the tuple a(z). We specify this kind of local transformation 
with a 2 x 2s matrix: 



a(0)i 


6(0)i 






a(0) s 


b(0) s 


o(l)i 








a(l) s 





It is convenient to see what happens for all possible values of Xi and y { and 
the following table shows the word that corresponds to these possibilities. 
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For simplicity let us assume s is even. 



*Ej Hi 


corresponding word 





a(0) 1 6(0) 1 ...a(0) 8 6(0) a 


1 


a(0)i6(l)ia(0) 2 6(l) 2 ...a(0) 8 6(l) a 


1 


a(l)i6(0)ia(l) 2 6(0) 2 ...a(l) s 6(l) a 


1 1 


a(l)i6(l)i...a(l),6(l) a 



Now we are ready to present the upper and lower bound results. 

Lemma 4.6. If M is commutative then N l (M) = 0(1). 

Proof. Let I be an order ideal in M. Suppose Alice is given to", ...,to" and 
Bob is given m\, ...,m b n . They want to decide if m\m b x ...m a n m b n G /. Since M is 
commutative, this is equivalent to determining if m^m^...m^m\m b 1 ...m b n G /. 
So Alice can privately compute the product m°...m° and send the result to 
to Bob. Observe that this requires a constant number of bits to be commu- 
nicated since the size of M is a constant. Bob can check if mm\...m b n G I 
and send the outcome to Alice. □ 

Lemma 4.7. If M is not commutative then for any order on M we have 
N\M) = fi(logn). 

Proof. Since M is not commutative, there must be a, b G M such that ab ^ 
ba. Therefore either ab ba or ba ab. Without loss of generality 
assume ba ^ M ab. Let I = (ab). We show that LT < 2 ™ (M,I). Alice gets 
x and constructs a sequence of 2 n monoid elements in which a is in position 
x and 1m is in everywhere else. Bob gets y and constructs a sequence of 2 n 
monoid elements in which b is in position y and 1m is in everywhere else. If 
x < y then the product of the monoid elements will be ab which is in /. If 
x > y then the product will be ba which is not in /. □ 

Denote by Com the positive language variety corresponding to the variety 
of commutative monoids Com. The above two results show that regular 
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languages that have constant non-deterministic communication complexity 
are exactly those languages in Com. 

The next step is to determine if there are regular languages outside of 
Com that have O(logn) non-deterministic complexity For this, we first need 
the definition of a polynomial closure. 

The polynomial closure of a set of languages £ in £* is a family of lan- 
guages such that each of these languages are finite unions of languages of the 
form 



where k > 0, a.i e £ and Li G C. If V is a variety of languages, then 
we denote by Pol{V) the class of languages such that for every alphabet E, 
Po/(V)(E) is the polynomial closure of V(E). PoliV) is a positive variety of 
languages ( |PW95j ). 

Lemma 4.8. If L is a language of Pol {Com) then N l (L) = O(logn). 

Proof. Suppose L is a union of t languages of the form L aiL 1 ...a/ ! .L; s . Alice 
and Bob know beforehand the value of t and the structure of each of these t 
languages. So a protocol for L is as follows. 

Assume Alice is given x1,...,x® and Bob is given x\,...,x b n . God com- 
municates to Alice and Bob which of the t languages the word x1x\...x^x^ 
resides in. This requires a constant number of bits to be communicated since 
t is a constant. Now that Alice and Bob know the Loa\Li...akLk the word 
is in, God communicates the positions of each a^. This requires fclogn bits 
of communication where A; is a constant. The validity of the information 
communicated by God can be immediately checked by Alice and Bob. All 
they have to do is check if the words in between the a^s belong to the right 
languages. Since these languages are in Com, this can be done in constant 
communication as proved in Lemma 14.61 Therefore in total we require only 
0(\ogn) communication. □ 




L aiLi...akLk 
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From the above proof, we see that we can actually afford to communicate 
O(logn) bits to check that the words between the a^'s belong to the corre- 
sponding language. In other words, we could have Lj 6 Pol{Com). Note 
that this does not matter since Pol(Pol(Com)) = Pol(Com). 

Denote by ( r T x T ) the number of factorizations of the word x as 
x = WoaiWi...a,kWk with Wi G Lj. When the <2j and the Lj are such that 
for any x we have ( LoaiL * ak L k ) ^ {^j 1}j then we say that the concate- 
nation LoaiL\...akLk is unambiguous. We denote by UPol(V) the vari- 
ety of languages that is disjoint unions of the unambiguous concatenations 
L aiLi...a kr L) s . with Li e V (in some sense, there is only one witness for x in 
UPoliV)). Similarly we denote by M p Pol(V) the language variety generated 
by the languages 

{x\( X ) = j modp} 

\LQa\Li...akL,kJ 

for some < j < p — 1 and Li e V. Observe that for Pol(Com) we have 
( r T x T ) unrestricted. 

Denote by UP the subclass of NP in which the number of accepting paths 
(or number of witnesses) is exactly one. We know that UP CC = P cc ( |Yan91j ). 
From [TT03j we know that regular languages having O(logn) deterministic 
communication complexity are exactly those languages in UPol(Com) and 
regular languages having O(logn) Mod p counting communication complexity 
are exactly those languages in M p Pol(Com). Furthermore, it was shown 
that any regular language outside of UPol(Com) has linear deterministic 
complexity and any regular language outside of M p Pol(Com) has linear Mod p 
counting complexity. So with respect to regular languages, UP CC = P cc = 
UPol(Com) and Mod p P cc = M p Pol(Com). Similarly we conjecture that with 
respect to regular languages NP CC = Pol{Com). 

Conjecture 4.9. //LCE* is a regular language that is not in Pol(Com), 
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then iV^L) = Q(n). Thus we have 

(0(1) if and only if L E Com; 

O(logn) if and only if L E Pol(Com) but not in Com; 
0(n) otherwise. 

As mentioned in Chapter 2, the gap between deterministic and non- 
deterministic communication complexity of a function can be exponentially 
large. However, it has been shown that the deterministic communication 
complexity of a function / is bounded above by the product cA r °(/)A^ 1 (/) 
for a constant c (Theorem I2.9p . and that this bound is optimal. The above 
conjecture, together with the result of |TT03] implies the following much 
tighter relation for regular languages. 

Corollary 4.10 (to Conjecture 14. 9[) . If L is a regular language then D(L) = 
maxjA^LXA^L)}. 

For any variety V, we have that Pol(V) fl co — Pol(V) = UPol(V) 
([Pm97j). This implies that N^L) = O(logn) and N°(L) = O(logra) iff 
D(L) = O(logn), proving a special case of the above corollary. 

An important question that arises in this context is the following. What 
does it mean to be outside of Pol(Com)l In order to prove a linear lower 
bound for the regular languages outside of Pol(Com), we need a convenient 
algebraic description for the syntactic monoids of these languages since (ig- 
noring the exceptions) lower bound arguments rely on these algebraic prop- 
erties. One such description exists based on a result of |PW95j that describes 
the ordered monoid variety corresponding to Pol{Com). Before stating this 
description, we fix some notation. 

If M is a monoid, we write M = (G, R) to indicate that M has the 
presentation (G, R) where G is the generating set and R is the set of relations. 
For instance, a cyclic group of order n has the presentation ({x},x n = 1) 
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and the dihedral group of order 2n has the presentation {{x, y}, x n = 1, y 2 = 
l,xyx = y). For any w G G*, we denote by eval(w) the element of M that 
w corresponds to. Observe that the transformation monoid corresponding to 
an automaton has a presentation in which the generating set consists of the 
letters of the alphabet. The relations depend on the particular automaton 
and can be determined by analyzing the state transition function each word 
induces. 

Lemma 4.11. Suppose L is not in Pol (Corn) and M = (G, R) is the syntac- 
tic ordered monoid of L with exponent uj . Then there exists u,v G G* such 
that 

(i) for any monoid M' G Com and any morphism (f> : M — > M' , we have 
4>(eval(u)) = (f)(eval(v)) and 4>(eval(u)) = <f)(ev al(u 2 )), 

(ii) eval(u u vu u ) ^ eval(u"). 

Although we cannot yet prove the conjecture, we can still show linear 
lower bounds for certain classes of regular languages outside of Pol (Com). 
Our first lower bound captures regular languages that come very close to the 
description given in the previous lemma. 

A word w is a shuffle of n words wi, w n if 

W = W ljl W 2 ,l...W njl W2,lW2,2---W ni 2W ljk W 2 ,k---Wn,k 

with k > and u^iu^.-.u^fc = Wi is a partition of Wi into subwords for 
1 < % < n. 

Lemma 4.12. If M — (G,R) and u,v G G* is such that 

(i) u = WiW 2 for Wi,w 2 G G* , 

(ii) v is a shuffle of w\ and W2, 
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(iii) eval(u) is an idempotent, 

(iv) eval(uvu) ^ eval(u), 

then N^M) = Q(n). 

Observe that the conditions of this lemma imply the conditions of Lemma 
14.111 since eval(u) is idempotent, for any monoid M' G Com and any 
morphism <fr : M — ► M', we have (j)(eval{uj) = <j)(eval(u 2 )) and since v is 
a shuffle of w\ and w 2 we have <j)(eval(u)) = (f>(eval(v)). Also, since eval(u) 
is idempotent, eval{u w ) = eval(u), and in this case eval(uvu) ^ eval(u) is 
equivalent to evaliu^vv?) ^ eva/(u UJ ). 



Proof of Lemma \4-lS\ We show that PDISJ < r (M, /) where J = (eval(u)). 
Since f is a shuffle of w\ and u>2, there exists k > such that 

f = Wl,lW2,lWi )2 W2,2-Wi )fc K; 2iA . 

The reduction is essentially linear and is given by the following matrix when 
k = 3. The transformation easily generalizes to any k. 



Wi 


e 


e 


e 


e 


w 2 ,i 


e 


W 2t2 


e 


^2,3 




U! 2 ,l 


w 1>2 


W 2 ,2 


Wl,3 


W 2 ,3 


e 


e 


e 


e 





Vi 


corresponding word 








^1^2,1^2,2^2,3 = U 





1 


^1^2,1^2,2^2,3 = U 


1 





Wi t lWi t2 Wi t 3W 2t lW 2t2 W 2t 3 = U 


1 


1 


Wl, 1^2,1^1,2^2,2^1,3^2, 3 = V 



After x and y have been transformed into words, Alice prepends her word 
with u and appends it with \u\ many e's, where \u\ denotes the length of 
the word u. Bob prepends his word with \u\ many e's and appends it 
with u. Let a(x) be the word Alice has and let b(y) be the word Bob 
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a 




a, b 



Figure 4.1: The minimal automaton recognizing the language whose syntactic 
ordered monoid is BA\. 

has after these transformations. Observe that if PDISJ(x, y) = 0, then 
there exists i such that Xi = yi = 1. By the transformation, this means 
that a(x)ib(x)ia(x)2b(x)2--.a{x) s b(x) s is of the form u...uvu...u and since 
eval(u) is idempotent, eval(a(x)ib(x)ia(x)2b(x)2---a(x) s b(x) s ) = eval(uvu) ^ 
eval(u). On the other hand if PDISJ(x, y) = 1, then by the transformation, 
a(x)ib(x) 1 a(x)2b(x)2---a(x) s b(x) s is of the form u...u and so 

eval(a(x)ib(x)ia(x)2b(x)2--.a(x) s b(x) s ) = eval(u). 

□ 

The above result gives us a corollary about the monoid BA\ which is 
defined to be the syntactic ordered monoid of the regular language recognized 
by the automaton in Figure H~T1 The unordered syntactic monoid of the same 
language is denoted by BA 2 and is known as the Brandt monoid (see |Pin97j ). 

Corollary 4.13. N X {BA%) = fi(n). 

Proof. It is easy to verify by looking at the transformation monoid of the 
automaton that BA\ = ({a, b},aa = bb, aab = aa, baa = aa, aaa = a, aba = 
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a, bab = b) . The only thing we need to know about the order relation is that 
eval(aa) is greater than any other element. This can be derived from the 
definition of the syntactic ordered monoid (Subsection 3.2.2) since for any 
Wi and u>2, Wiaaw2 is not in L. So W\aaw2 G L W1XW2 G L trivially 

holds for any word x. Let u = ab and v = ba. These u and v satisfy the four 
conditions of the previous lemma. The last condition is satisfied because 
eval{uvu) = eval(abbaab) = eval(aa) and eval(ab) ^ eval(aa). Therefore 
N 1 (BA£) = n(n). □ 



Denote by U~ the syntactic ordered monoid of the regular language (a U 
b)*aa(a U b)*, and denote by U the unordered syntactic monoid. Also let 
U + be the syntactic ordered monoid of the complement of (a U b)*aa(a U b)*. 
Observe that N 1 ^^) = O(logn) since all we need to do is check if there are 
two consecutive a's. By an argument similar to the one for Corollary I4.13[ 
one can show that N 1 (U + ) = Q(n). 

Our next linear lower bound result is for non-commutative groups. 

Lemma 4.14. If M is a non- commutative group then N l (M) = Q(n). 

Proof. Since M is non-commutative, there exists a, b G M such that the 
commutator [a, b] = a _1 6 _1 a6 ^ 1. This means that [a, b] has order q > 1. 
Let m G M be such that there is no m! G M with m! ^ m and m! < m. 
Denote by / the order ideal that just contains m. There is a reduction from 
IP q to (M,I). The reduction is essentially local. Alice and Bob will apply 
the transformation given by the following matrix. 



1 


1 


1 


1 


a- 1 


b- 1 


a 


b 
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-Ei Hi 


corresponding word 





1 


1 


b^b = 1 


1 


a~ x a = 1 


1 1 


a~ 1 b~ 1 ab 



After, Alice will append m to her transformed input and Bob will append 1 
to his. Observe that the product of the monoid elements evaluates to m if 
and only if Yli<i<n x 'iyi = m °d Q i- e - t ne product is < m if and only if 
J2i<i<n x iVi = mod 9- D 

To obtain our last linear lower bound result, we need the following fact. 

Proposition 4.15. Any stable order defined on a group G must be the trivial 
order (equality). 

Proof. Suppose the claim is false. So there exists a,b e G such that a ^ b 
and a < b. This implies 1 < a _1 6 =: g. If 1 < g then g < g 2 , g 2 < g 3 and so 
on. Therefore we have I < g < g 2 < ■■■ < g k = I. This can only be true if 
1 = g, i.e. a = b. □ 

We say that M is a T q monoid if there exists idempotents e, / e M such 
that (ef) q e = e but (ef) r e ^ e when g does not divide r. 

Lemma 4.16. If M is a T q monoid for q > 1 £/zen ^(M) = Q(n). 

Proof. Observe that {e, e/e, (e/) 2 e, (e/)' 3_1 e} forms a subgroup with iden- 
tity e because since e is idempotent, we have (ef) l e ■ (ef) j e = (ef) l+j e. 
Therefore any order on M must induce an equality order on this set. Let 
/ = (e). We show IP q < r (M, I) via the following local reduction. 



e(e/)« 


{efYe 


e 


fe 
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corresponding word 





e(e/)«(e/)*e = e 


1 


e(e/)«/e = e 


1 


e(ef) g e = e 


1 1 


efe 



Observe that the product of the monoid elements evaluates to 

( e f\El<i<n x iVi=0 mod,?^ 

which is equal to e if and only if IP q (x, y) — 1. □ 

Combining our linear lower bound results together with Lemma 14. 4[ we 
can conclude the following. 

Theorem 4.17. If M is a T q monoid for q > 1 or is divided by one of BA^ , 
U + or a non- commutative group, then A^ 1 (M) = Q(n). 

We underline the relevance of the above result by stating a theorem which 
we borrow from jTT05j. 

Theorem 4.18. If M is such that D(M) ^ O(logn) then M is either a T q 
monoid for some q > 1 or is divided by one of BAi, U or a non- commutative 
group. 

The three linear lower bound results imply the following result, which 
gives us three sufficient conditions for not being in Pol(Com). 

Theorem 4.19. Let L be a regular language with syntactic ordered monoid 
M = {G, R) . If one of the following holds, then L is not in Pol{Com). 

1. There exists u,v G G* such that u = W\W2, v is a shuffle ofui\ and W2, 
eval{u) is an idempotent and eval{uvu) ^ eval{u). 

2. M is divided by a non- commutative group. 
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3. M is a T q monoid for q > 1 . 

In particular, if M{L) is a T q monoid or is divided by one of BA^, U + or 
a non-commutative group, then L is not in Pol {Com). 



Chapter 5 



Conclusion 



The focus of this thesis has been the non-deterministic communication com- 
plexity of regular languages. Regular languages are, in some sense, the sim- 
plest languages with respect to the usual time/space complexity framework, 
but in the communication complexity model, they require a non-trivial study 
as there are complete regular languages for every level of the communication 
complexity polynomial hierarchy. This fact can be derived from the results in 
|Bar86j and |BT87] . In |TT03] , a complete characterization of the communi- 
cation complexity of regular languages was established in the deterministic, 
simultaneous, probabilistic, simultaneous probabilistic and Mo d p - counting 
models. In order to get a similar algebraic characterization for the non- 
deterministic model, one needs the notion of ordered monoids, a more general 
theory than the one used in [TT03J, to be able to deal with classes of lan- 
guages that are not closed under complementation. This thesis presents the 
fundamentals of communication complexity, monoid theory as well as ordered 
monoid theory and obtains bounds on the non-deterministic communication 
complexity of regular languages. 

Our results constitute the first steps towards a complete classification for 
the non-deterministic communication complexity of regular languages. We 
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know exactly which regular languages have constant non-deterministic com- 
munication complexity. We know that there is a considerable complexity gap 
between those languages having constant non-deterministic complexity and 
the rest of the regular languages since if a regular language does not have 
constant complexity than it has Q(logn) complexity We also obtain three 
linear lower bound results and the importance of these results are highlighted 
by Theorem 14.181 These results also provide us with several sufficient condi- 
tions for not being in the variety Pol(Com), which is a result very interesting 
from an algebraic automata theory point of view. This result also exemplifies 
how computational complexity can be used to make progress in semigroup 
theory. 

Our ultimate objective is to get a complete characterization of the non- 
deterministic communication complexity of regular languages. We conjecture 
that regular languages in Pol (Com) are the only languages that have O(logn) 
complexity and any other regular language must have Q(n) complexity. The 
linear lower bound argument presents a real challenge. A natural next step 
to take is to explicitly find a regular language that is not in Pol(Com) for 
which our current linear lower bound arguments do not apply and try to 
either prove a linear lower bound for this specific language or show that it 
requires 0(n e ) complexity for a constant e < 1 (which would disprove our 
conjecture). A linear lower bound argument for this language is likely to 
apply to some other languages outside of Pol(Com), if not all. The regular 
language recognized by the automaton in Figure EU] is an example of a regular 
language that is outside of Pol(Com) and for which we cannot get a linear 
lower bound nor a sublinear upper bound. We call this language L5. In the 
Appendix, we prove that L5 is not in Pol(Com) and various other facts about 
L 5 . 

An interesting property of L 5 is that it belongs to Po/(A/"z/g) where M1I2 
denotes the variety of languages that correspond to the variety of nilpotent 
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Figure 5.1: An automaton recognizing a language L 5 outside of Pol{Com). 

groups of class 2. Nilpotent groups of class 2 are usually considered as "al- 
most" commutative groups. In some sense, this says that even though L 5 is 
not in Pol(Com), it is very "close" to it. 

We propose several intuitive reasons of why proving a linear lower bound 
for this regular language can be challenging (assuming that the linear lower 
bound is indeed true). We also suggest possible approaches to overcome 
the difficulties. All of these tie with the importance of the problem we are 
studying from a communication complexity point of view as well as from a 
semigroup theory point of view. 

First of all, from Chapter 2 we know that the best lower bound technique 
we have for non-determinism is the rectangle size method. Inherent in this 
method is the requirement to find the best possible distribution. Needless 
to say, this can be quite hard. And even if the best distribution was known, 
bounding the size of any 1-monochromatic rectangle can be a non-trivial 
task. Putting these two things together, the rectangle size method does not 
seem to considerably simplify our task of bounding the size of the optimum 
covering of the 1-inputs. 
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Consider the set S of all functions having Q(n) non-deterministic com- 
munication complexity. Define an equivalence relation on these functions: 
/ = g if there is a rectangular reduction of length 0(n) from f to g and from 
g to f. We can turn S/ = into a partially ordered set (poset) by defining 
the order [/] < [g] if there is a rectangular reduction of length 0(n) from 
/ to g. It certainly would not be surprising if there were regular languages 
appearing in the lower levels of a chain in this poset and this would suggest 
that obtaining a lower bound for these languages can be difficult. 

If the above is indeed true, then what can be done about this? A natural 
step would be to find functions that are at the same level or below the regular 
language at hand, and try to get a reduction that would prove the language 
has linear non-deterministic complexity. This raises our interest in promise 
functions. 

Let / be a boolean function with the domain {0, l} n x {0, l} n . A promise 
function Pf is a function that has a domain D that is a strict subset of /'s 
domain and is such that for any (x,y) G D, Pf(x,y) = f(x,y). An example 
of a promise function is the PROMISE-DISJOINTNESS function, PDISJ. 
Promise functions are interesting because through a promise, we can define 
functions that reside in the lower levels of a chain. This in return can make a 
reduction possible from the promise function to the regular language of inter- 
est. For instance, PDISJ is a promise function which lies below DISJ and 
IP q (Example 12.51) . Of course an important point when defining a promise 
function is that we need the promise function to have Q(n) complexity. In 
some sense, through the promise, we would like to eliminate the easy in- 
stances and keep the instances that make the function hard. At first, there 
might be no reason to believe that obtaining a linear lower bound for the 
promise function is any easier than obtaining a lower bound for the regular 
language. Nevertheless, the purpose of this line of attack is the following. 
By putting a promise on a well-known, well-studied function (that makes a 
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reduction possible), we may be able to utilize (or improve) the various tech- 
niques and ideas developed for the analysis of the original function to prove 
a lower bound on the promise function. 

Now we define a promise function, PROMISE-INNER-PRODUCT (PIP 2 ), 
such that there is a reduction from this function to L5 (see Appendix) . PIP 2 
is the same function as IP2 but has a restriction on the (x, y) for which 
IP 2 (x,y) = 0. We only allow the 0-inputs which satisfy the following two 
conditions: 

Vi, Xi = and y { = 1 ==>> IP 2 (x 1 ...x i _ 1 ,y 1 ...y i ^ 1 ) = 

Vi, ^ = 1 and y< = =>• IP 2 (x 1 ...x i _ 1 ,y 1 ...y i ^ 1 ) = 1 

It remains an open problem to prove a linear lower bound, or a sublinear 
upper bound on PIP 2 . 

The fact is that little is known about promise functions. One promise 
function we know of is PDISJ. As a consequence of the celebrated work of 
Razborov ( |Raz92j ) . which shows that the distributional communication com- 
plexity of the DISJOINTNESS function is fi(n), we know that N 1 (PDISJ) = 
Q(n) as well. Given the description of regular languages outside of Pol(Com) 
(Lemma 14.111) . PDISJ is one of the first functions one tries to get a reduc- 
tion from, where the reduction is as in the proof of Lemma 14.121 This hope 
is hurt by the fact that such a reduction does not exist from PDISJ to L 5 
(see Appendix). 

We believe that more attention should be given to promise functions since 
the study of these functions is likely to force us to develop new techniques 
in communication complexity and give us more insight in this area. Further- 
more, given the connection of communication complexity with many other 
areas in computer science, promise functions are bound to have useful appli- 
cations. For instance, in a very recent work of Gal and Gopalan QGG07J), 
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communication complexity bounds for a promise function is used to prove 
bounds on streaming algorithms. 

We have looked at our question from a communication complexity per- 
spective. Now we look at it from a semigroup theory perspective. The key to 
making progress on our question can be finding a more convenient description 
of what it means to be outside of Pol(Com). The description that we have 
(Lemma l4.1ip actually applies to Pol{V) for any variety V, and it is based on 
a complicated result of [PW95J that makes use of a deep combinatorial result 
of semigroup theory ( [Sim89] , |Sim90] . [Sim92j ) . Since we are only interested 
in Pol(Com) where Com is a relatively simple variety, it may be possible 
to obtain a more useful description that allows us to show communication 
complexity bounds. 

We conclude that, in any case, the resolution of our question will probably 
lead to advances in either communication complexity or semigroup theory, if 
not both. 



Appendix A 
Facts About L5 




In this appendix, we prove some of the facts about the regular language 
L 5 that we mentioned in Chapter 5. We start with the fact that L 5 is not 
in Pol(Com). For this, we need a result that describes the ordered monoid 
variety corresponding to Pol(Com). This description involves the Mal'cev 
product and topological issues which we choose to avoid for simplicity. The 
interested reader can find the necessary information about these in [Pin97j. 
Here we will state a restricted version of this result which suffices for our 
needs. 

Lemma A.l. Let L be a language in Pol(Com) and let M = (G,R) be the 
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syntactic ordered monoid of L with exponent u. Then for any u,v E G* with 
the property that any monoid M' G Com and any morphism <fi : M — > M' 
satisfies both (f>(eval(u)) = <f>(eval(v)) and <f>(eval(u)) = (f)(eval(u 2 )) , we must 
have eval{u U} vu UJ ) < evaliy!^). 

Proposition A. 2. L 5 is not in Pol (Com). 

Proof. Consider the transformation monoid of L 5 , which is the syntactic 
monoid. Let u = abab and v = bbaa. Observe that eval(u) is an idempotent 
and this u and v satisfy the condition in the lemma. We show eval(uvu) ^ 
eval(u). Observe that eval(uvu) = eval(v) so we want to show eval(v) ^ 
eval(u). If the opposite was true, then by the definition of the syntactic 
ordered monoid (Subsection 3.2.2), we must have for any ui\ and w 2 , W\uw 2 G 
L => W1VW2 G L. In particular, for W\ — e and w 2 = aa, we would have 
uaa G L =^ vaa G L. It is true that uaa G L but vaa L. □ 

Now we show that the PROMISE-INNER-PRODUCT function that we 
defined in Chapter 5 reduces to L 5 . 

Proposition A.3. PIP 2 < L 5 

Proof. The reduction is linear and is given by the following matrix. 



a 


e 


e 


b 


a 


b 


e 


e 


a 


b 


e 


a 


b 


a 


e 


b 



%i Hi 





1 

1 

1 1 



corresponding word 



abab 
abaaab 

abbb 
ababab 



state transition 



1, 2 
1, 2 
5, 2 
3, 2 



5, 3 
5, 3 
5, 3 
5, 3 



3, 4 
5, 4 
3, 4 
1, 4 



5, 5 
5, 5 
5, 5 
5, 5 



After this transformation is applied, Alice appends her word with b and Bob 
appends his word with e. Now if PIP 2 (x,y) = 1 then the transformed word 
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must end up at state 5. This is because, from state 1, we either enter state 
5 and stay there forever, or we go to state 3 when x^ — yi — 1. If we were in 
state 3 already, Xi — yi — 1 takes us back to state 1. PlPiix, y) = 1 implies 
that there are an odd number of indices for which x± — y^ — 1. So after the 
linear transformation, assuming we do not end up in state 5, we would end 
up in state 3. The b appended at the end of the transformed word would 
ensure that we end up in state 5. If PIP 2 (x, y) = 0, then we do not want to 
end up at state 5. Observe that the promise ensures we never enter state 5 
and since there are even number of indices for which Xi — yi — 1, we must 
end up at state 1. The b appended at the end of the word just takes us from 
state 1 to 2. □ 



Observe that we can restrict the 1-inputs of PIP2 the same way we re- 
stricted the 0-inputs and the reduction would trivially work for this case as 
well. Putting a promise on both the 0-inputs and the 1-inputs may help 
analyzing the complexity of PIP<i- 

Proposition A. 4. There is no local reduction from PDISJ to L 5 such that 
the reduction is of the form 





Vi 


corresponding word 








u w 





1 


u u 


1 







1 


1 


V 



where u and v satisfy the conditions of Lemma 1^.11 



Proof. (Sketch). Since is an idempotent, it must induce a state transition 
function in which either 

1. k — > k, j — » j and other states are sent to 5, or 

2. k — > k and other states are sent to 5, or 
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3. every state is sent to 5. 

Observe that it cannot be the case that u u is a partial identity on more than 
two states. If u u satisfies condition 2, then we cannot have eval(u u 'vu u ') ^ 
evaliu"). Suppose this is the case. Then there exists Wi,w 2 such that 
wiu u} w 2 e L and wiu 0J vu u 'w2 L. Since the latter is true, it must be the 
case that w\ takes state 1 to A; and u>2 must take Ho a state other than 5. 
These W\ and do not satisfy WiU w W2 € L, so we get a contradiction. This 
shows we cannot have condition 2. Similarly, one can show that cannot 
satisfy condition 3, which leaves us with condition 1. This means u w is either 
(abab) k or (baba) k for some k > 0. We assume it is (abab) k . The argument 
for (baba) k is very similar. 

Given u w = (abab) k , and the fact that we want to satisfy eval(u u 'vu u ') ^ 
evaliu^), one can show that the state transition function induced by v must 
be one of the following. 
1.1 — > 3 and any other state is sent to 5. 

2. 3 — > 1 and any other state is sent to 5. 

3. 1 — > 3 and 3 — >• 1 and any other state is sent to 5. 

Suppose v satisfies condition 1. 
Case 1: v — (ab) 2t ~ l for t > 0. Consider the matrix representation of the local 
reduction. In this matrix A, we count the parity of the a's in two ways and 
get a contradiction. First we count it by looking at the rows. The first row 
must produce the word u u = (abab) k and the second row must produce the 
word v = (ab) 21 " 1 so in total we have odd number of a's. Now we count the 
parity of a's by looking at ^1^2,2^1, 3^2,4--- and ^2,1^1,2^2,3^4.... Both 
of these must produce the word (abab) k so in total we must have an even 
number of a's. 

Case 2: v — (ab) 2t bb... for t > 0. Let c be the column where we find the 
second b in the second row. Give value 1 to entries of A which are a and give 
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value -1 to entries of b. Other entries (the e's) get value 0. In terms of these 
values we have 

c 
i=l 

and 

c 

5>i,*e{<U}- 

Adding the two sums, we get a negative value. Now we count the same total 
in a different order. Assuming c is even we have 

c/2 c/2 

i=i i=i 

and 

c/2 c/2 

i=l i=l 

The total is positive. This is a contradiction. 

Case 3: v — (a6) 2 *~ 1 a(aa) 2 *6... for t > 0. Similar argument as above. 

Same ideas show that i> cannot satisfy neither conditions 2 nor 3. □ 



Bibliography 



[Bar86] D. A. Barrington. Bounded- width polynomial-size branching pro- 
grams recognize exactly those languages in NCI. In STOC '86: 
Proceedings of the eighteenth annual ACM symposium on Theory 
of computing, pages 1-5, New York, NY, USA, 1986. ACM Press. 

L. Babai, P. Frankl, and J. Simon. Complexity classes in commu- 
nication complexity theory (preliminary version). In FOCS 7 86: 
Proceedings of the 27th Annual IEEE Symposium on Foundations 
of Computer Science, pages 337-347, 1986. 

[BNS92] L. Babai, N. Nisan, and M. Szegedy. Multiparty protocols, pseu- 
dorandom generators for logspace, and time-space trade-offs. J. 
Comput. Syst. Set., 45(2):204-232, 1992. 

[BPS07] P. Beame, T. Pitassi, and N. Segerlind. Lower bounds for lovasz- 
schrijver systems and beyond follow from multiparty communica- 
tion complexity. SI AM Journal on Computing, 37(3):845-869, 2007. 

[BT87] D. Barrington and D. Therien. Finite monoids and the fine structure 
of NCI. In STOC '87: Proceedings of the nineteenth annual ACM 
conference on Theory of computing, pages 101-109, New York, NY, 
USA, 1987. ACM Press. 



80 



81 



BIBLIOGRAPHY 



[CFL83] A. K. Chandra, M. L. Furst, and R. J. Lipton. Multi-party pro- 
tocols. In STOC '83: Proceedings of the fifteenth annual ACM 
symposium on Theory of computing, pages 94-99, New York, NY, 
USA, 1983. ACM Press. 

[Eil74] S. Eilenberg. Automata, Languages, and Machines. Academic 
Press, Inc., Orlando, FL, USA, 1974. 

[Fur87] M. Furer. The power of randomness for communication complexity. 

In STOC '87: Proceedings of the nineteenth annual ACM conference 
on Theory of computing, pages 178-181, New York, NY, USA, 1987. 
ACM Press. 

[GG07] A. Gal and P. Gopalan. Lower bounds on streaming algorithms 
for approximating the length of the longest increasing subsequence. 
In FOCS '07: Proceedings of the 48th Annual IEEE Symposium 
on Foundations of Computer Science, pages 294-304, Washington, 
DC, USA, 2007. IEEE Computer Society. 

[Gro92] V. Grolmusz. Separating the communication complexities of MOD 
m and MOD p circuits. In IEEE Symposium on Foundations of 
Computer Science, pages 278-287, 1992. 

[Gro98] V. Grolmusz. Circuits and multi-party protocols. Computational 
Complexity, 7(1):1-18, 1998. 

[HG91] J. Hastad and M. Goldmann. On the power of small-depth threshold 
circuits. Computational Complexity, 1:113-129, 1991. 

[KN97] E. Kushilevitz and N. Nisan. Communication Complexity. Cam- 
bridge University Press, 1997. 

[MP71] R. McNaughton and S. Papert. Counter-free Automata. MIT Press, 
Cambridge, MA, USA, 1971. 



BIBLIOGRAPHY 



82 



[New91] I. Newman. Private vs. common random bits in communication 
complexity. Inf. Process. Lett, 39(2):67-71, 1991. 

[Nis93] N. Nisan. The communication complexity of threshold gates, 1993. 

[Pin86] J.-E. Pin. Varieties of formal languages. North Oxford, London- 
Plenum, New- York, 1986. (Traduction de Varietes de langages 
formels) . 

[Pin95] J.-E. Pin. A variety theorem without complementation. Russian 
Mathematics (Izvestija vuzov.Matematika), 39:80-90, 1995. 

[Pin97] J.-E. Pin. Syntactic semigroups. In G. Rozenberg and A. Salomaa, 
editors, Handbook of formal languages, volume 1, chapter 10, pages 
679-746. Springer, 1997. 

[PW95] J.-E. Pin and P. Weil. Polynomial closure and unambiguous prod- 
uct. In ICALP 7 95: Proceedings of the 22nd International Collo- 
quium on Automata, Languages and Programming, pages 348-359, 
London, UK, 1995. Springer- Verlag. 

[PW96] J.-E. Pin and P. Weil. A Reiterman theorem for pseudovarieties of 
finite first-order structures. Algebra Universalis, 35:577-595, 1996. 

[Raz92] A. A. Razborov. On the distributional complexity of disjointness. 
Theor. Comput. Set., 106(2) :385-390, 1992. 

[Raz04] R. Raz. Circuit complexity and communication complexity, vol- 
ume 10 of IAS/Park City Mathematical Series, pages 159-197. 
American Mathematical Society, 2004. 

[RM97] R. Raz and P. McKenzie. Separation of the monotone NC hier- 
archy. In FOCS 7 91: Proceedings of the 38th Annual Symposium 



83 



BIBLIOGRAPHY 



on Foundations of Computer Science, pages 234-243, Washington, 
DC, USA, 1997. IEEE Computer Society. 

[Sch65] M. P. Schiitzenberger. On finite monoids having only trivial sub- 
groups. Information and Control, 8 (2): 190-194, 1965. 

[Sim75] I. Simon. Piecewise testable events. In Proceedings of the 2nd 
GI Conference on Automata Theory and Formal Languages, pages 
214-222, London, UK, 1975. Springer- Verlag. 

[Sim89] I. Simon. Properties of factorization forests. In Proceedings of the 
LITP Spring School on Theoretical Computer Science on Formal 
properties of finite automata and applications, pages 65-72, New 
York, NY, USA, 1989. Springer- Verlag New York, Inc. 

[Sim90] I. Simon. Factorization forests of finite height. Theor. Comput. 
Set., 72(l):65-94, 1990. 

[Sim92] I. Simon. A short proof of the factorization forest theorem. In 
M. Nivat and A. Podelski, editors, Tree Automata and Languages, 
pages 433-438. North-Holland, Amsterdam, 1992. 

[TT03] P. Tesson and D. Therien. Complete classifications for the commu- 
nication complexity of regular languages. In Theoretical Aspects of 
Computer Science, 2003. 

[TT04] P. Tesson and D. Therien. Monoids and computations. Interna- 
tional Journal of Algebra and Computation, 14:801-816, 2004. 

[TT05] Pascal Tesson and Denis Therien. Complete classifications for the 
communication complexity of regular languages. Theory Comput. 
Syst, 38(2):135-159, 2005. 



BIBLIOGRAPHY 



84 



[Yan91] M. Yannakakis. Expressing combinatorial optimization problems 
by linear programs. Journal of Computer and System Sciences, 
43(3):441-466, December 1991. 

[Yao79] A. C.-C. Yao. Some complexity questions related to distributive 
computing (preliminary report). In STOC ,r /9: Proceedings of the 
eleventh annual ACM symposium on Theory of computing, pages 
209-213, New York, NY, USA, 1979. ACM Press. 



